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ABSTRACT 

Scaling is carried out in an effort to increase the 
comparability of scores obtained on different tests. This study 
explored the relationships between College Board Achievement Test 
scores and potential scaling covariates for various subgroups of the 
test-taking population with the goal of providing several 
alternatives to traditionally used scaling procedures. The 
alternatives vary components such as scaling covariates, scaling 
samples, and characteristics of the hypothetical reference group. The 
traditional method of scaling the Achievement Tests uses scores on 
the Scholastic Aptitude Test (SAT) verbal and mathematics tests as 
seal ing covariates for all 14 achievement tests . To determine 
additional covariates that might be used, correlations of test scores 
with 17 potential covariates were examined, and stepwise regressions 
were carried out. As a result, the Achievement Tests were grouped 
into five clusters. Two alternative scaling procedures were proposed. 
The first consists of scaling the tests within the five independent 
clusters; and the second consists of a two-stage process of scaling 
within the clusters independently and using the results of the 
initial scaling to scale the tests again with SAT verbal and 
mathematics scores as coumon covariates. (Contains 9 figures, 12 
tables, and 3 references.) (SLD) 
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EXECUTIVE SUMMARY 



Although the statistical processes of scaling and test equating are 
mathematically verv similar, the purposes, and end results, of the two 
procedures are often quite different. This is certainly the case for the 
Admissions Testing Program (ATP) Achievement Tests. 

Test equating is the method that is most often used with the Achievement 
Tests to maintain form-to-form comparabi litv of test scores. Every time a new 
form of an Achievement Test is introduced (for example, a new form of the 
Biology Test) , scores on the new form are equated to scores on a previous form 
of the test. This tvpe of equating can be carried out quite rigorously and 
assures that scores obtained by a student v^ho may have taken a form of the 
Blologv test In the spring of the vear can be compared with confidence to scores 
obtained bv a student who, perhaps, took a different form of the Biology Test in 
the fall of the vear. Form-to-form equating is essential if scores obtained on 
different forms of the same test are to be compared for college admissions 
purposes . 

Form-to-form equating also plays an Important role when scores on the same 
test are to be evaluated over a long period of time, perhaps in an effort to 
examine trends in test scores for large populations. Similarly, form-to-form 
equating is essential if scores on a particular test are to be used for 
placement purposes. For example, cut scores used to place students into classes 
at differing levels may be developed and validated using a particular form of a 
test and then used when students taking alternate forms of the test are tested 
for placement purposes. If the placement decision is to be effective, scores 
obtained on the <^orms of the test must be comparable to scores obtained on the 
test form, or forms, that were used to initially establish and validate the 
placement cut score. 
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Scaling, on the other hand, as It is used for ATP Achievement Tests, Is 
carried out in an effort to increase comparability of scores obtained on 
different tests, e.g., scores obtained on the Biology Test with those obtained 
on the Chemistry Test. Hence, if it is necessary for college admissions 
purposes to compare scores obtained by students taking a Biology Test with those 
obtained by students taking a Chemistry Test, scaling becomes Important. 

An issue that has concerned those interested in maintaining both the across 
test and form-to-fom comparabilitv of Achievement Test scores is that it has 
been very difficult to perform both of those procedures simultaneously. It has 
been possible to maintain the within test comparability required for placement 
and some admissions decisions as well as trend analyses by form-to-form 
equating. However, periodic scalings^ of the tests, designed to Increase across 
test comparability required for admissions decisions, has interrupted the 
form-to-form comparability which is necessary for the previously mentioned 
placement decisions, some admissions decisions, and trend analyses. 

Whether or not it is iT^ore Important to scale the tests to Increase across 
test comparabilitv of scores or to pursue rigorous form-to-form equating is not 
so much a statistical matter as it is a matter of program priorities and test 
use. k^at is a statistical concern is the fact that the procedure used to scale 
the tests and, hence, to maximize across test comparability fat the expense of 
form-to-form comparabilitv) has been questioned. It is this latter concern that 
is the focus of the study summarized in this report. 

The purpose of the study described in this report was to explore the 
relationships between College Board Achievement Test scores and potential 
scaling covarlates for various subgroups of the test taking population. It was 
hypothesized that such an exploration would lead to the following: 



Periodic scalings of the ATP Achievement Tests have not been carried out since 
1980 . ^ 
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o The selection of additional scaling covariates that might provide 
Improved scaling results for those tests that do not provide scores 
correlating highly with SAT-V and/or SAT-M scores; 

o The review and possible respeclf icatlon of the sample of students that 
are used to scale the tests; and 

o The review and possible respeclf icatlon of the hypothetical scaling 
population • 

The eoal of the present study was to provide several alternatives to the 
scaling procedures traditionally used for the Achievement Tests. The 
alternatives will, bv their nature, varv components such as scaling covariates, 
scaling samples, and characteristics of the hypothetical reference group. 

A brief summary of the results of the study follows. First, the results of 
the investigation of the characteristics of the scaling sample are discussed. 
Second, the choice of additional scaling covarlates and their relationship to 
clusters of tests are summarized. Third, the results of the examination of the 
characteristics of the hypothetical reference group are discussed. 

Characteristics of the Scaling Sample 

The traditional method of scaling the Achievement Tests emplovs a scaling 
sample with the following characteristics. The sample is based on high school 
lunlors and seniors. Juniors are selected from those students who take the 
tests at the May and June administrations and seniors are selected from students 
who take the tests at the November, December, and January administrations. 

Several trends were observed in summarizing the results of a number of grade 
level analvses done for the study. First, it appears, as has traditionally been 
the situation, that the tests are taken predominantly by high school seniors at 
the November, December, and January administrations and by lunlors at the Mav 
and June administrations. There are, however, some exceptions to this rule. 
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For one, the Biology Test Is taken predominately by sophomores at the June 
administration. Also, the Tiiterature, Physics, and Mathematics Level II Tests 
are taken by almost equal numbers of juniors and seniors at the May 
administration . 

One question is, given the fact that the basic test taking patterns (seniors 
taking the tests in the fall and juniors in the spring) do not seem to have 
changed greatly over the past several decades, is there a reason for revising 
the specifications that are presently used to draw the scaling samples? 

The results of the analyses conducted for the study Indicated that the 
practice of sampling high school seniors taking the tests in November, December, 
and January and juniors taking the tests in May and June provides a viable 
sampling option. A potential alternative to this practice is related to whether 
to include high school sophomores taking the tests in June. It was decided that 
an effort should be made to Include sophomores is the scaling sample if 
additional analyses indicate that scores on missing covariates can be estimated 
adequately and if the regressions show that Achievement Test scores obtained by 
this group have a similar relationship to the relevant covariates as that 
displayed by scores obtained by other sampling groups. 

Additional Scaling Covariates 

The traditional method of scaling the Achievement Tests uses scores on SAT-V 
and SAT-M as scaling covarlates for all 14 tests. In addition, semesters of 
study (as assessed by Achievement Test Background Questionnaire responses) is 
used as a covarlate for scaling foreign language tests. 

Both Braun and Tucker (see Dorans, 1985) have pointed to problems with the 
traditional scaling method that are related to choice of covarlates. Basically, 
the results of both their studies indicated that when ability has a ma1or Impact 
on test selection (as it does with the Achievement Tests), there is a clear need 
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to rescale the tests. The quality of this scaling will depend, however, on how 
highly the covarlates used to rescale the tests correlate with Achievement Test 
performance . 

In order to determine what additional covarlates might he added to the model 
used to scale the Achievement '’'ests, correlations of test scores with 17 
potential covarlates were examined. And, more Importantlv, stepwise regressions 
were carried out. The stepwise regressions were used to answer questions such 
as: If SAT-V and SAT-M. scores are used to scale Achievement Test scores, will 
the scaling Improve If semesters of study are added as an additional covarlate? 

In addition, the results of the stepwise regression analvses were compared to 
determine If several tests could be clustered together, l.e., tests within a 
cluster show a similar relationship to a common core of covarlates and, hence, 
can all be rescaled using the covarlates. It should be noted that. In choosing 
tests to form the clusters, practical considerations were taken into account, 
l.e., efforts were made to keen tests together that formed logical clusters such 
as the science tests, the language tests, etc. 

As a result of the regression analyses, the Achievement Tests were organized 
Into five clusters. Cluster 1, referred to as the English Cluster, contains the 
English Composition Test and the Literature Test. Cluster 2, referred to as the 
Flstorv Cluster, contains the American History and Social Studies Test, and the 
European Hlstorv and World Cultures Test. All tests In both the clusters 
correlate hlghlv with SAT-V scaled scores, reasonably highly with SAT-M scaled 
scores, and moderately with grades In the respective subiect areas. The 
European History and World Cultures Test scores show the lowest correlation with 
these covarlates. This is particularly true of scores obtained at the December 
administration of the test. 
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Cluster 3, the Mathematics Cluster, contains the Mathematics Level I and It 
Tests. Proposed covariates for these two tests are SAT-M scaled scores and 
Achievement Test Backj^round Questionnaire responses assessing amount of study in 
mathematics. High correlations with SAT-M scores and moderate correlations with 
Background Questionnaire responses are displayed by the Mathematics Level T and 
Mathematics Level IT data. 

Proposed covarlates for Cluster 4, the Science Cluster, containing the 
Biology, Chemistry, and Physics Tests, are SAT-V scaled scores, SAT-M scaled 
scores, and amount of course work in biology and sciences or amount of course 
work in phvsical sciences, respectively. The Chemistry and Physics Tests show 
correlations with the covariates that are slightly more similar to each other 
than thev are to the correlations displayed bv the Biology Test scores. Scores 
obtained on the Biology Test have a tendency to be a little more highly 
correlated with SAT-^^ scores and a little less highly correlated with SAT-M 
scores than scores obtained on the other two tests In the cluster. 

Cluster 5 contains three of the five language tests, French, Spanish and 
Latin. Results of the analyses were not amenable to clustering either the 
Hebrew Test or the German Test. The additional covariate proposed for the 
cluster, along with those used bv the traditional procedure (SAT-V and SAT-M 
scaled scores and Achievement Test Background Questionnaire responses assessing 
amount of study in a language). Is grades in foreign languages. All of the 
tests show a fairly strong correlation with this covarlate. 

To summarize, the tests were sorted into five scaling clusters based on the 
results of a stepwise regression analyses, as well as on practical 
considerations. The results of the clusterings were confirmed by an examination 
of the correlations of test score with selected covariates. It Is suggested 
that, based on the results of this portion of the study, two alternative scaling 
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procedures be considered. The first procedure consists of scaling the tests 
within the five Independent clusters. No attempt would be made to establish any 
type of relationship among the scores obtained on tests that are members of 
different clusters. The second procedure consists of a two-stage process. The 
first stage consists of scaling the tests within the clusters independently. 

The second stage would use the results of the initial scaling and scale the 
tests again, using SAT-V and SAT-M scores as common covariates for all lA tests. 

Hypothetical Scaling Population 

The hypothetical reference population that is used In the traditional 
Achievement Test scaling procedure Is assumed to have a scaled score mean of 500 
on SAT-V and SAT-M, a scaled score standard deviation of 100 on both these 
variables, and a correlation between scores obtained on SAT-V and SAT-M of .60. 

Tt has been pointed out bv Rraun (see Dorans, 1985) that these values influence 
the scaling results. The results of Braun’s studies Indicated that choice of 
reference population is of critical importance. Braun found that different 
reference groups produced different Achievement Test scale alignments. 

The aralvses corducted for this part of the study indicated that, in 
general, SAT-v scaled score means for the total groups that take the Achievement 
Tests at the five administrations range between roughly 500 and 575. SAT-M 
scaled score means obtained by the same groups range, roughly, between 5?5 and 
675. The total group correlation between scores on these tv^o tests range 
between .AA and .69 with the maiorltv of the correlation coefficients falling 
somewhere in the vicinity of .55. Furthermore, examination of the information 
provided regarding scaled score standard deviations Indicates that SAT— V scaled 
score standard deviations obtained bv the total groups taking the lA tests range 
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It is fairly clear, from the inforaation presented above, that SAT-V and 
SAT-M scores obtained by the groups taking the 14 tests are quite diverse. 
Furthermore, it is clear that a hypothetical population with means and standard 
deviations of 500 and 100, respectively, on SAT-V and SAT-M has little empirical 
basis (although this population does have historical importance). The 
correlation between SAT-V and SAT-M scores that is presently used for the 
hypothetical population (.60) does, however, have some empirical basis. 

A num’^er of possibilities exist for alternatives to the way in which the 
hypothetical reference group is currently specified. Given the diversity of the 
scores on the covariates obtained by the various groups, an empirical approach 
might be the most feasible. This approach could be one that involves pooling 
scores on the various covariates and using the pooled means, standard 
deviations, etc., as the values to specify for the hypothetical reference group. 

One empirical approach is to pool data obtained for the common covariates 
that are used within a cluster, i.e., five hypothetical reference populations 
would be established. Next, if a two stage scaling procedure were used, data 
could be pooled across the covariates common to the 14 tests (SAT-V and SAT-M 
scores) and used to specifv the hypothetical reference group for the second 
stage of the scaling. 

The alternative scaling procedures suggested above represent, to some 
extent, a compromise between the traditional scaling method and form-to-form 
equating. Because the procedures Involve clustering tests that have similar 
relationships to a common set of covarlates, some of the problems with the 
traditional scaling method, alluded to by Tucker and Braun, should be reduced. 
The suggested procedures represent changes in methodology that should lead to 
greater comparability of scores on tests grouped within clusters, but less 
comparability on scores obtained on tests that are members of different 
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Vclusters. The procedures that involve two-stage scaling represent a further 



Gompromise between a one-stage procedure and the traditional procedure. Because 



procedure involves a second stage scaling that uses common covariates ^or 



:^tbe 14 tests, scores obtained on tests that are members of different clusters 



^ight he considered roughly comparable. However, because of the problem of 
Slif ferent ial relationship of Achievement Test scores to SAT-V and SAT-M scores. 



™:some lack of comparability of the scales established across the clusters will 
"prol^bly be difficult to avoid. 

To summarize, based on empirical evidence gathered in this study, 
alternative scaling procedures have been suggested. These procedures represent 
a comnromise between the current scaling method and form-to-form equating. 

Which procedure offers maximum comparability of scores either within a cluster 
or across clusters, while at the same time minimizing scaling problems resulting 
from the differential relationship of scores to covariates, mav be a question 



that is amenable to future empirical investigation. 
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BACKGROUND 

l\^en the obiecffve forms of the College Board Achievement Tests were first 
introduced in about 1942 for operational use in admissions, the tests were 
scaled in such a way that the mean for the group choosing to take each of them 
was set at 500, and the standard deviation at 100. That is to say, the average 
of each group of candidates taking its test was made to appear equal to the 
average performance of each of the other groups of candidates taking their 
tests. Similarly with their standard deviations. As a consequence of this 
scaling design, the score a candidate received on a test was dependent on, among 
other things, the abilitv level of the group of examinees who took the 
particular test. For example, a candidate would appear more able if he or she 
took a test typically chosen bv less able examinees and less able if he or she 
chose a test typically taken by high ability examinees. Inasmuch as each 
candidate in the tested population was (appropriately) regarded as competitive 
with each of the other candidates in a contest for the same reward, l.e., 
admission to college, this state of affairs appeared to be~and Indeed, 
vas--unf air ; anv candidate who understood the design of the score scales, and 
wished to appear to be relatively knowledgeable in his/her field, could adopt 
the strategy, if they chose to, of selecting the Achievement Test normally taken 
by the least able candidates. In order to remove this element of unfairness, a 
scaling system was designed in the middle 1940^s to adjust the scales for the 
several Achievement Tests to reflect the level and dispersion of ability of the 
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candidates taking each test. For example, a test typically taken by a more able 
group of candidates was made to yield an average scaled score higher than 500, 
and a test typically taken by a less able group of candidates was made to vield 
an average scaled score lower than 500. 

The operational scale definition adopted to achieve this result was that the 
candidate of average ability, relative to a hypothetical aggregate of all 
candidates taking the College Board tests, would earn a score of 500 regardless 
of the Achievement Test that he or she chose to take; also, that the dispersion 
of scaled scores for this hypothetical aggregate population would be defined 
with a standard deviation of 100. Thus, higher-ability Achievement Test groups 
would automatlcallv have higher means and, correspondingly, lower-ability 
Achievement Test groups would have lower m^ans. This definition was Implemented 
by defining "general ability" (more specifically, "general academic abilitv") as 
measured by the verbal and mathematical Scholastic Aptitude Tests (SAT-V and 
SAT-M) , respectively; and the degree to which the SAT-V and -M scores played a 
role In this operational definition was a direct function of the relevance of 
those tests for the particular Achievement Test in question, as measured bv the 
correlation of the SAT scores with the scores on the particular Achievement 
Test. A further adiustment was later Introduced into this svstem by adding 
semesters of study to the SAT-V and -M scores for scaling the language tests. 
This adiustment was Intended to account for the fact that some languages were 
typically studied for longer periods of time than were other languages. 

A distinction was made in earlier years, as it is made todav, between the 
efforts to provide optimum calibration or equivalence between one form and 
another within each of the various Achievement Tests (equating) and the efforts, 
as lust described, to maintain the optimum Interrelationships among the scales 
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for the various tests (scallnj^). The first of these is roughly analogous to the 
calibration of, say, successively manufactured scales for weighing obiects in 
order to insure that the same object will yield the same reading of weight in 
pounds, v/hether the particular weight scales in question were manufactured this 
past vear or whether they were manufactured 20 years ago. Here, we observe, the 
calibration is carried out as a study of the relation between two instruments 
that provide measurements in the same domain, namely weight or, in the case of 
the forms of the individual Achievement Tests — in German, in English Literature, 
in Phvslcs, etc. — forms that measure the same content domain but which may 
differ from one another in level of difficulty. 

The second of these, scaling, presents some of the same considerations that 
must be borne in mind when developing height-weight tables, for example. Unlike 
alternative Instruments used for taking measurements in the same domain, as in 
height or weight or temperature, the relation between measures in different 
domains, like height ys weight, is seen clearly to vary depending on the nature 
of the group on which the relationship is based, for example, whether the group 
consists of males or females; ectomorphs, mesomorphs, or endomorphs; or 
adolescents, voung adults, middle-aged, or elderly people. Clearly, a set cf 
relationships based on one of these types of groups will not necessarily apply 
to other groups. Similar considerations apply to the Achievement Tests. A set 
of relationships among the Achievement Tests based on one group of students, say 
largely male, will not apply to a group of students composed largely of females. 
Similarly, a set of relationships based on students who have been exposed to one 
type of curriculum efnphasis in a subject-matter area will not apply to students 
who have been exposed to another type of curriculum emphasis. Yet, it is the 
intent of the scaling effort to develop one set of interrelationships among the 
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scales for the different Achievement Tests that will apply at least reasonably 
well to all component subgroups of students. This is a task that is, 
admittedly, Impossible to execute rigorously with respect to all reasonable 
criteria. Yet, the alternative, to let each test scale be defined bv the group 
who chooses to take that test, is, for reasons already discussed, even less 
attractive . 

Since the introduction of the obiective Achievement Tests, it has become 
clear that they have been useful for two different, and — as it later became 
evident — somewhat conflicting contexts. One of these was the use of the tests 
in admission decisions, in which students choose to take different combinations 
of one, two, or three Achievement Tests at a single sitting. It was for this 
purpose that the scaling of the Achievement Tests in relation to one another was 
originally designed. As already indicated, although the methodology used to 
achieve this between-test comparability fell short of the ideal, it was the 
best, indeed the only defensible, methodology available. However, since the 
between-test comparability system designed at one point in time is likely to 
change with changes in population or changes in secondary school curricular 
emphasis — just as the height-weight relationship might change with a change in 
the population or with a change of dietary habits in the population — a 
far-sighted general plan for scaling, or establishing comparability, should 
normally provide for a periodic re-examlnatlon of the interrelationships among 
the tests, and an adjustment in the system, if that were found to be indicated. 

The second use of the Achievement Tests resided, and still resides, in the 
context of course placement. In this context, a university administration 
typically chooses a score level above which a student may be permitted to enter 
a more advanced level of instruction, and below which the student would be 
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required to take a more elementary, less demanding, or sometimes even a remedial 
course. In order for the university to maintain the same level of standards 
from one vear to the next, or to be able to change its standards deliberatelv 
and with full understanding of the desired consequences, it is essential that 
the meaning of the "passing,” or "admit," score be rigorously maintained from 
one vear to the next. This implies that within each test — Physics, European 
Historv and World Cultures, Mathematics Level IT, etc. — extreme effort be given 
to rigorous form-to-form equating. Tf, however, the svstem is designed to 
provide for optimum between— test comparabilitv , to be revised as necessarv 
whenever the existing comparabilitv is found to be out of date, then the 
within-test equating svstem is necessarilv weakened. The two 

purposes — within-test equivalence and between-test comparability are therefore 
ant ithe t lea 1 j thev cannot both he served simultaneously and equallv well. 

On the other hand, it is also true that the purposes of equivalence and 
comparability do make some of the same demands. Since college applicants may 
take an Achievement Test at any one of various different administrations, the 
forms of a test offered at the different administrations, in order to be 
directly comparable, must be well equated. Tf, however, the comparability 
between forms is changed, as it must be to be responsive to changing populations 
and changing curricula, the score equivalences between the forms of a given test 
will necessarily be disturbed. The result is that comparisons among individuals 
offering scores for admission on the same test, but from different 
administrations, will be affected. 

In general, random or systematic statistical errors in equating will have an 
effect on both course placement decisions and admissions decisions. It will be 
use f ul to review the general theory and method of equat Ing scores on alternate 
forms of the Achievement Tests to demonstrate this. 
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The conventional wav in which scores on these forms are equated is to carry 
over a subset of the items of which an old form of an Achievement Test is 
comprised and include the subset in the new form (the form to be equated). 
Inasmuch as the groups taking the different forms of a test differ, sometimes 
markedly, from one administration to another, it is necessary to use the 
information on these common items to make statistical adlustmcnts for the 
differences between the groups. For ideal equating, the groups taking the two 
forms should be large in size and only randomly different from one another. 

Since this is almost never the case in performing equating in the context of the 
Achievement Tests, it is Important that the common items constitute a 
substantial proportion of the test and that they represent the total test form 
in all respects — content, difficulty, and discriminating power — in brief, to 
serve as a parallel miniature of the parent form of the test. Even when these 
demands are met, it is inevitable that, because of the less-than-perfect methods 
of equating currently available to us, there will be random drift in the scales 
for the Achievement Tests, which will, in turn exert a disturbing effect on 
placement and admissions decisions. But when neither of these conditions are 
met — when the groups taking the different forms are not rai;domly equivalent, 
and, additionally, when common items cannot be drawn in sufficient numbers or in 
the desired manner, then systematic equating error necessarily results, with a 
consequent, sometimes appreciable, drift in the Achievement Test scales. 

In 1959, Professor Samuel Wilks of Princeton University was engaged to 
review the work of equating and scaling the SAT and the Achievement Tests* The 
scope of his review Included not only an examination of the particular methods 
in use at the time, it also Included an examination of the system, its 
philosophy, and its mode of implementation. One of the questions under 
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consideration in his review was that of the relative emphasis to be placed on 
the efforts to perpetuate the scales of the Individual Achievement Tests and 
provide undisturbed form-to-forra score equivalence (equating) versus the 
emphasis to be placed on the efforts to maintain the appropriate, up-to-date 
interrelationships among the scales for those tests (scaling). Professor Wilks 
recommended that the latter (scaling) should be the first order of business. 
Accordingly, a plan was instituted to rescale the tests each year, and to 
average the results of the rescaling with the existing between-test or equating 
relationships, expecting that the differences between the two efforts would in 
short time diminish to the vanishing point. This plan was put into effect, but 
the results turned out not as predicted; a review of the work of rescaling in 
the 1970’s revealed that the scaling operation was not moving consistently in 
one direction, but fluctuated from one rescaling to another, and by sizeable 
amounts. In the meantime, because of these efforts to develop the correct 
relationship among tests by periodic rescaling, the existing form-to— form 
relationships were— —as anticipated — disturbed; and this caused some concern in 
the universities who depended on a constant within— test scale structure for the 
effective use of the tests in course placement, and in some Instances, college 
admissions. 

The model used bv Wilks ^'and used to scale the Achievement Tests until 1980, 
when all scaling was halted) for defining comparability among the several 
Achievement Tests specifies a base population for which the comparability 
applies to be one with a mean of 500 and standard deviation of 100 on both the 
SAT-V and SAT-M. In addition, It was originally specified, when this 
comparability was first Introduced in the 1940*s, that the correlation between 
SAT-V and SAT-M for that population would be ,40, a not unrealistic figure at 
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that time. More recent statistical summaries show that the correlations between 
Verbal and Math vary between .65 and .70, for the most part. To reflect this 
change in relationship between SAT-V and -M scores, the correlation between them 
that was used for the reference population in scaling was raised to .60 in the 
mid-1970*s. Moreover, the means and standard deviations for Verbal and Math are 
no longer 500 and 100. Verbal means based on a total year’s tested group are 
about 430, and Math means are about 475; the standard deviations are typically 
higher than 100, about 110 for Verbal and 120 for Math. However, the scale for 
each Achievement Test had historical significance in that it was based on a mean 
and standard deviation estimated for the same population for which the 
Scholastic Aptitude Test Verbal and Math scales were defined, the population for 
which those means were 500 and standard deviations, 100. In the case of the 
language Achievement Tests, an additional variable was used, supplementing the 
Verbal and Math scores, namely, the number of semesters for which candidates 
typically studied each language. 

On the face of it, the foregoing model seems to be an excellent conception 
for a testing program in which every candidate is asked to take the same core, 
test (the SAT), but in which each candidate is at liberty to take one or more 
optional tests (the Achievement Tests), chosen from a list available to him or 
her. However, there are several difficulties with the model which will be 
described in the following section of this report. 

PROBLEM AND PURPOSE 

In the early 1980’s, Henry Braun and Ledyard Tucker (see Dorans, 1985) 
conducted studies designed to investigate Achievement Test rescaling (as defined 
by Wilks, 1959) that used both real and simulated data. These studies were 
undertaken to gain a better understanding of how operational decisions affected 
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the outcomes of scaling. The effects of changing the definitions of the 
standard reference group and of changing the definition of the samples used for 
calculating the scaling ec|uations as well as the effects of various choices of 
covariates were given particular attention. 

One of Braun's studies was conducted using data from the December 1979 
A-chievement Test administration. In this study, Braun investigated, among other 
things, the effect of the choice of reference group on the rescaling results. 

(As mentioned previously, the method used until 1980 employed a hypothetical 
reference group with a mean of 500 and a standard deviation of 100 on SAT-V and 
SAT-M and a correlation between the scores on the two tests of .60). The 
results of Braun’s studies indicated that choice of reference group had more 
than historical importance. Braun found that different reference groups 
produced different Achievement Test scale alignments. It should be noted that 
the choice of a suitable reference group has been discussed for many years, but 
no fully satisfactorv way of redefining the standard reference group has been 
identified to date. 

The simulation studies carried out by Tucker and Braun (see Dorans, 1985, 
for a detailed description of these studies) both employed a simulation model 
designed by Tucker to approximate the Achievement Test scaling situation. 
Tucker's model consisted of essentially two inter-related components. One 
component modeled Achievement and Aptitude Test performance and the second. 
Achievement Test selection. A maximum of two Achievement Tests could be taken 
in Tucker's study. The results of Tucker's study demonstrated that (Dorans, 
1985, p. 17) 

o The set of candidates who take the same pair of Achievement Tests need not 
have the same means and standard deviations on two properly aligned scales 
for the two tests* This finding demonstrated that it would be unwise to 
rescale by attempting to set means and standard deviations equal for all 
existing subgroups of candidates that took two Achievement Test forms. 
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o To the extent that developed academic skill plays a significant role in 
the test selection process, then there will be differential selection 
biases introduced that affect the rescaling results. 

o To the extent that Achievement Tests correlate highly with their 

respective covarlates, the differential selection biases can be overcome 
in the rescaling process. 

As mentioned previously, Braun employed Tucker’s simulation model in his 
study, which was an expansion on Tucker’s work. I^ereas Tucker limited his 
simulees (simulated test takers) to a maximum of two Achievement Tests, Brauii 
allowed his simulees to take a maximum of three Achievement Tests. Br.aun 
simulated performance on eight hypothetical Achievement Tests, which varied in 
their relationships to the rescaling covarlates, SAT-V and SAT-M scores. Braun 
also used two reference populations to determine the rescaling equations; the 
traditionally used hvpothetical reference population (with scaled score mean on 
SAT--V and --M of 500 and standard deviation of 100) and the observed scaled score 
means and standard deviations of the December 1979 Achievement Test population. 
Braun’s results led him to the following conclusions: 

o As was indicated by his empirical studies, the choice of reference 
population can have a significant Impact on rescaling results. 

o Similar to Tucker’s conclusions, the presently used methodology 
works well if Achievement Test score? are highly related to SAT-V 
and SAT-M scores or if the student’s choice of Achievement Test is 
not primarily a function of his/her ability. 

This latter point should be expanded upon. Both Tucker and Braun’s results 
indicated that rescaling works well provided that ability has no Impact on the 
test selection process. However, as noted earlier, ability clearly does affect 
the test selection process. Only if ability did not affect the selection 
process would rescaling no longer be necessary. When ability has a maior Impact 
on test selection, there is a need for rescaling and the quality of the 
rescaling will depend upon how highly the covarlates (SAT-V and SAT-M) used to 
rescale the test correlate with Achievement Test performance. 
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To summarize, research results so far indicate that 1) because the test 

selection process is ability related, rescaling is most probablv necessary, and 

\ 

7 .^ the currently used methodoloKy is influenced by the choice of reference group 
and by the relationship between Achievement Test scores and the covarlates 
CSAT-V and SAT-M scores) that are presently used to rescale the tests. 

The purpose of the study described in this report was to explore the 
relationships between College Board Achievement Test scores and potential 
scaling covariates for various subgroups of the test taking population. It was 
expected that such an exploration would lead to the following: 



o The selection of additional scaling covarlates that might provide 
Improved scaling results for those tests that d not provide scores 
correlating highly with SAT-V and/or SAT-M scores; 

o The re-spec 3 f icat 1 on of the sample of students that are used to 

scale the test, i.e., such a re-specification might possibly lead to 
Achievement Test scores that show a higher correlation with selected 
scaling covariates and; 

o The re-spec5 f icatlon of the hvpothetical scaling population. As 
Braun and Tucker pointed out, the characteristics of the 
hypothetical population differentially affect the scales of the • 
tests. A change in specifications for this population might 
possibly provide scales that are more appropriate for some of the 
tests. 

The final goal of the present study was to provide several alternative 
scaling procedures for the Achievement Tests.* The procedures presented will be 
based on empirical evidence gathered in the study and will vary components such 
as scaling covarlates, scaling samples and characteristics of the hypothetical 
reference group. 

It was anticipated that all 14 tests would not be amenable to similar 
treatment: therefore, in presenting results, the tests will be clustered and 
alternative procedures will be specified for each cluster. 
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METHODOLOGY 

Description of the Tests 

The 14 Achievement Tests fall into five general subject areas: 

English 

English Composition (two versions: all-multiple choice and multiple- 

choice with essay) 

Literature 

Foreign Languages 
French 
German 
Hebrew 
Latin 
Spanish 

History and Social Studies 

American History and Social Studies 
European History and World Cultures 

Mathematics 

Mathematics Level I 
Mathematics Level II 

Sciences 

Biology 

Chemistry 

Physics 

All the Achievement Tests take one hour of testing time, and consist 
entirely of multiple-choice questions except the English Composition Test with 
Essay, which consists of a 20 minute essay and 40 minutes of multiple-choice 
questions* The tests vary in content as well as in the number of multiple 
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choice questions they contain. The reader Is referred to Taking the Achievement 
Tests , for a detailed description of test content. The approximate number of 
questions contained In each test form Is listed In the table below. 



Test 



Approximate Number 
of Questions 



English Composition with Essay 

English Composition without Essay 

Literature 

French 

German 

Hebrew 

T-atln 

Spanish 

American History and Social Studies 

European History and World Cultures 

Mathematics Level I 

Mathematics Level It 

Biology 

Chemistry 

Physics 



70 multiple choice 
Items plus one essay 
90 
60 
85 
80 
90 
70 
85 
95 
95 
50 
50 
95 
85 
75 



Data Collection 

The Achievement Test data used to provide a base for this study was obtained 
from tapes used to select new form samples for operational equating purposes. 
These tapes are prepared when no less than sixty percent of the total volume of 
answer sheets from a particular administration have been scanned and scored. 

All cases available on the tapes were used to form the data base for this study. 
Tn no Instance did the scaled score mean of the group used for the study differ 



by more than a few points from the total population of examinees taking the test 



at a specific administration. 

The following Achievement Test administrations were used for the study. 
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o November 
o December 
o January 
o May^ 
o June 



1983, 1984 

1983, 1984 

1984, 1985 
1984, 1985 
1984, 1985 



Data from the 1983 and 1984, or 1984 and 1985 administrations were 
collapsed and treated as though they came from the same administration, i.e,, 
all analyses were carried out on a total of five administration groups. 

Scholastic Aptitude Test (SAT) scaled scores used for this study were the 
most recent scores obtained by candidates during the period from March 
1983-January 1984 and the period March 1984-January 1985 (depending upon whether 
candidates took the Achievement Tests with the 1983-84 or 1984-85 cohort). 

Analyses 

The analyses consisted of obtaining, for each Achievement Test, counts, 
distributions, summary statistics, and intercorrelations for the following 
variables: 

o Achievement Test Scaled Score 
o SAT-V Scaled Score 
o SAT-M Scaled Score 



It should be noted that, at the time data were collected for this study, the 
small volume tests, European History and World Cultures, German, Hebrew, and 
Latin, were given only at the December and May administrations.* Beginning with 
testing year 1986-87, these tests are offered only at the December and June 
administrations. 
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o Achievement Test Backgroud Questionnaire responses (for the 
French, German, Latin, Physics, Mathematics Level I and 
Mathematics Level II tests) 

o Student Description Questionnaire (SDQ) responses to questions 
5-17, 23, 24, 39, 40, 43.^ 

The analyses were performed for each test bv administration bv grade level 
combination . 

Several points require noting. First, for this exploratory study, it was 
decided to analyze a large subset of available variables without particular 
regard to their suitability for use in operational scaling. The purpose was to 
assess how much the correlation between Achievement Test scores and the 
covariates could be increased and to identify the covariates which were most 
useful for enhancing the correlation. (The reader is referred to Tables 1 and 2 
for a detailed description of the variables.) Second, since the data were 
collected for this study, a new Student Descriptive Questionnaire, as well as 
additional Achievement Test Background Questionnaires have been introduced in 
the Admissions Testing Program. The results of the study are limited in that 
data from those new questionnaires were not available for analyses. 

In addition to the counts, summary statistics, and intercorrelations, 
covariate selection (stepwise regression) analyses were also carried out for 
each test bv administration by grade level combination. The regression 
procedure used is referred to as the Maximum Improvement Technique (MAXR) and 
is available as part of the SAS Institute Statistical Analyses System (1982). 

R^ refers to the squared multiple correlation between any two or more variables. 
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SDQ auestlons 24, 39, 40 and 43 were used in the regression analyses only. 
StSdLts fill out the SDQ when they register to take the 
Tests. They have an opportunity to update information collected y 
questionnaire whenever they register for another ATP test. 
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The Maximum R Improvement Technique Is considered superior to a 
conventional stepwise technique and almost as good as all possible regression 
solutions. Unlike most stepwise procedures, this technique does not settle on a 
single model. Instead, the procedure tries to find the best one variable model, 
the best two variable model, and so forth. The MAXR method begins by finding 
the one variable model producing the highest R^. Then another variable, the 
one that yields the greatest Increase In R^, Is added. Once the two variable 
model Is obtained, each of the variables In the model Is compared to each 
variable not In the model. For each comparison, MAXR determines If removing one 
variable and replacing It with the other variable Increases R^. After comparing 
all possible switches, the one that produces the largest Increase In R^ Is made. 

Comparison begins again, and the process continues until MAXR finds that no 

2 

switch could Increase R • The difference between conventional stepwise 

2 

techniques and the Maximum R‘ Improvement Method Is that. In the Maximum r’ 
Improvement Method, all switches are evaluated before any particular switch Is 
made. In conventional stepwise procedures the "worst” variable may be removed 
without considering what adding the "best" remaining variable might accomplish. 
For the purposes of this report, only the results at the analyses producing the 
best two—, three—, four—, and five— variable models are reported. 

Several points should be made regarding the type of regression analysis 
carried out for this study. First, the regression procedure could have been 
controlled In such a way that the covarlates historically used to scale the 
tests (SAT-V, SAT-M, and In some cases, semesters of study) were automatically 
considered as part of the model. The computer program would then have searched 
for additional covarlates that Increased R above that observed for the 
covarlates traditionally used. The researchers chose not to do this for the 
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present study. Instead, they conducted a completely exploratory analysis, 

2 

resulting in a choice of covariates that maximized R for a particular data set. 

Secondly, like all stepwise regression procedures, the MAXR procedure is 
sample dependent, i.e., ideally, for the most generalizable results, the 
analyses should be carried out using the population of interest or a 
representative sample of this population. The current analysis was carried out 
using all available cases with scores on all variables of interest. This 
constitutes a very large sample of Achievement Test takers; however, the sample 
may not be representative of the total population because, as mentioned 
previously, it was selected when approximately 60% of the answer sheets for a 
particular administration were scored and because of the restriction that 
examinees have scores on all variables. 

It is Important to note that the actual groups used to carrv out the 
correlational, regression, and analyses of summary statistics varied according 
to the type of analysis done. In order to provide the most representative 
results, all available cases were used to compute summary statistics for the 
various test scores. However, because not all students took the same 
Achievement Tests and because not all students have SAT scores, samples used to 
compute these summary statistics varied. 

The correlational analyses were carried out using all available cases with 
scores on the two variables that were paired. Thus, sample size varies from 
pairing to pairing. Finally, as mentioned above, the regression analyses were 
carried out using only those examinees who had scores on all the variables of 
interest. Consequently, sample size varies from one regression analvsis to 
another. 
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Table 1 contains brief descriptions of the variables used as Achievement 
Test covarlates in the correlation and regression analyses* As seen in Table 1, 
variables 1 and 2 are SAT-V and SAT-M scaled scores respectively* Test booklets 
containing SAT forms, such as those used In this study, consist of six 30-mlnute 
sections: two SAT-V sections, two SAT-M sections, one Test of Standard Written 
English, and one experimental section. The two SAT-V sections contain a total 
of 85 five-choice Items compose of 25 antonyms, 20 analogies, 15 sentence 
completions and 5 reading passages each of which is followed by five Items based 
on the passage. The two SAT-M sections contain a total of 60 Items composed of 
40 five-choice regular mathematics Items and 20 four-choice quantitative 
comparisons Items . 



Insert Table 1 about here 

Variables 3-16 and 18-21, listed In Table 1, refer to questions on the 
Student Description Questionnaire (SDQ) . The actual texts of the questions used 
appear In Table 2, Variable 17 Is amount of training in a subject as assessed 
by the Background Questionnaires that were administered with the Achievement 
Tests. Only the following tests contain a Background Questionnaire: 

Mathematics Level T and IT, Physics, French, German, Spanish and Latin. The 
Hebrew Test also contains a questionnaire, but the Hebrew Test Is In the process 
of being extensively revised, so data from the Hebrew Test Queslonnalre were not 
evaluated for the study. Table 3 contains the actual text of the 
questionnaires . 



Insert Tables 2 and 3 about here 
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Finally, it was necessary to recode responses to the Student Descriptive 
Questionnaire Items as well as responses to the Achievement Test Background 
Questionnaires prior to analyses. The Information given In Table 4 describes 
how the recoding was carried out. 

Insert Table 4 about here 



RESULTS 

It is important for the reader to keep in mind the purposes of the study as 
the results of the analyses are reviewed. One purpose of the study was to 
consider respeclficatlon of the hypothetical reference population. SAT scaled 
scores were collected and summarized by grade level and administration for the 
groups taking the 14 Achievement Tests. It was expected that examination of 
these data would lead to confirmation or respecification of the manner In which 
the hypothetical scaling population Is specified. 

Achievement Test scaled scores obtained by grade level groups taking the 14 
Achievement Tests across the five administrations were also summarized. The 
data were used to fulfill a second purpose of the study; l.e., examination of 
the Achievement Test sample used for scaling purposes. It was expected that an 
examination of these data would lead to a confirmation or respeclf Icatlon of the 
manner in which samples of Achievement Test examinees are traditionally selected 
for scaling studies. 

Finally, correlational analyses and regression analyses were conducted to 
examine the relationship between Achievement Test scores and potential scaling 
covariates. The results of these studies were used to develop clusters of 
Achievement Test scores that had similar relationships to subsets of the 
covarlates investigated in the study. 
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Thft results of the analyses are summarized in Tables 5-12 and Figures 1-9. 

Table 5 contains summarv statistics by grade level and administration for the 14 

tests. Summary statistics were computed for all available cases with 
Achievement Test scores. It should be noted, when reviewing the results found 
in Table 5, that four of the tests, European History and World Cultures, German, 

Hebrew and Latin were given only at the December and May Administrations. 

Insert Table 5 about here 

Table 6 presents summary statistics for SAT-V and SAT-M scaled scores for 
groups taking the 14 Achievement Tests. As was the case for the information 
displayed in Table 5, summaries were obtained by grade level for the five 
administrations. Two points should be noted when reviewing the data presented 
in Table 6. First, since not all students taking an Achievement Test at a 
particular administration have SAT scores on their records, the sample sizes 
reported in Table 6 are somewhat smaller than those reported in Table 5. 

Second, as was mentioned previously, four of the small volume tests were given 
only at the December and May Administrations. 

Insert Table 6 about here 

Table 7 contains correlation coefficients of Achievement Test scaled scores 
with the variables that were examined in the study as potential scaling 
covariates. The covariates are designated in the table as variable 1 fVar. 1) 
through variable 17 (Var. 17). As mentioned previously, titles and detailed 
descriptions for the variables are found in Tables 1-3. Table 4, as previously 
mentioned, contains information pertaining to the wav in which the variables 
were recoded prior to conducting the correlation and regression analyses. 
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The correlations presented In Table 7 were calculated for the total group 
taking the particular Achievement Test at a particular administration. Grade 
level Information was obtained as part of the study, but Is not presented in 
this report. Again, It should be noted that only the December and May 
administration summaries report results for all 14 tests. Further, Its 
Important to note that the number of candidates used to compute the correlations 
vary from pairing to pairing. Minimum sample sizes for each test are given In 
Table 7. Table I, found In the appendix, provides the correlation matrices 
fwlth accompaning sample sizes for each correlation coefficient) that are 
summarized In Table 7. 

Insert Table 7 about here 

The information presented In Table 8 summarizes the results of the stepwise 
regressions carried out for the study. The regression analyses were carried out 
for the 14 tests by grade level and administration date. As mentioned 
previously. Its important to note that the analyses were produced only for those 
examinees who had scores on all the variables of Interest. Table II, (found In 
the appendix) provides summary statistics and Intercorrelation matrices for the 
various analysis groups. Only those analyses based on sample sizes of 100 cases 
or more were Included In Table 8. 

Insert Table 8 about here 

The results of the analyses will be presented in the following manner: 

First, the results of all the analyses conducted for a particular test will be 
presented for each of the 14 tests. Then the results of the analyses will be 
compared across the various tests. 
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English Composition Test: Summary Statistics 

Table 5 contains summary statistics by grade level and administration for 
the English Composition Test. It Is Important to note that the total group 
Includes some candidates who did not specify grade level. 

It Is apparent from Table 5 that, as expected, the maiorlty of the test 
taking population are high school juniors and seniors, with seniors being more 
prevalent at the November, December and January administrations and iunlors 
representing the major portion of test takers at the May and June 
administrations. The English Composition Test Is not typically taken by high 
school freshmen. The only administration that contains a sizable number of 
freshmen candidates is the June Administration. High school sophomores also 
represent a very small part of the total candidate pool, at least for the 
November, December and January administrations. 

An examination of the Achievement Test scaled score means for the various 
groups (those for groups with sample sizes less than 25 are not Included In 
Table 5) indicates that high school sophomores have the highest scaled score 
mean of all grade levels taking the test at the November administration. This 
mean, however, is based on only 33 candidates and they are apparently very 
highly self selected. The scaled score means obtained by juniors and seniors 
taking the test at the November administration are quite similar, with the 
junior mean being just slight Iv higher. The highest scaled score mean obtained 
at the December administration (567) was again obtained by sophomore candidates 
(N=27). The junior mean for this administration (555) was considerably higher 
than that obtained by the high school seniors (521). 

For the January administration, the highest English Composition Test mean 
score was obtained by high school juniors, the next highest bv high school 
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sophomores, and the lowest by those students reporting their grade level as 
senior. A similar pattern is found in the mean scores obtained for the English 
Composition Test given at the May administration. High school juniors obtained 
the highest mean score (527) and seniors the lowest mean (485) of the three 
grade levels. Finally, the mean scores obtained by candidates taking the test 
at the June administration are rank ordered high to low as follows; juniors, 
sophomores, freshmen, and seniors. 

When one considers the achievement levels of the groups fas assessed by mean 
scores on the English Composition Test) across the five administrations, the 
following trends become clear. First, seniors produced the lowest English 
Composition Achievement Test mean scores at all five administrations. The mean 
scores obtained by seniors for the five administrations are rank ordered high to 
low as follows: November, December, January; followed by very similar May and 

June means. The pattern of mean scores for junior candidates Is somewhat 
different from that found for senior candidates, l.e., higher mean scores were 
obtained by candidates taking the test in December, with January scores being 
the next highest. Scores obtained by juniors at the November administration are 
the next highest, followed by those obtained by junior June candidates. The 
lowest English Composition Test mean scores obtained by juniors are for the May 
administration. It appears that, in general (with the exception of sophomore 
scores obtained at the November and December administrations) , juniors are the 
highest scoring group at each of the respective administrations. 

As mentioned above, sophomores produced the highest scaled score means for 
any grade level at both the November and December administrations. But, It 
needs to be kept in mind, that the sample sizes are quite small for sophomores 
taking the test at these two administrations. Also, the standard deviation of 
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the scores is quite high for the December sophomore group. It is interesting to 
note that sophomores produced higher scores than senior candidates at both the 
May and June administrations. Also interesting is the fact that freshmen taking 
the test in June obtained a higher scaled score mean than seniors who took the 
test at this same administration. 

Table 6 contains SAT-V and SAT-M summary statistics by grade level for 
candidates taking the English Composition Test at the five administrations. 

Again it is Important to note that "total” Includes those examinees who did not 
specify grade level. 

Considering only the junior and senior SAT-V and -M means for the November 
administration that are presented in Table 6, it is clear that the junior 
candidates are more able (as assessed by SAT-V and SAT-M scores) than the high 
school seniors. It should be noted that the correlation between SAT-V and SAT-M 
scores is lower for junior candidates (.51) than for senior candidates (.57). 
SAT-V and SAT-M scaled score means for juniors and seniors taking the English 
Composition test at the December Administration show similar trends to those 
exhibited by the November data» i.e., juniors obtained higher mean scores on 
both SAT-V and SAT-M than senior candidates. One difference between scores 
obtained at these two administrations can be noted in the correlation between 
SAT-V and SAT-M scores; this correlation is .55 for both groups. 

For the January administration, sample sizes are large enough to consider 
the mean scores obtained by high school sophomores as well as by juniors and 
seniors. As previously noted, high school juniors score higher than seniors on 
both SAT-V and SAT-M. Interestingly, the 26 sophomores whose scores are 
summarized for the January administration also score higher than the seniors who 
took the test at this administration. High school sophomores produced the 



O 



38 



- 25 - 



highest correlation between SAT-V and SAT-M scores (.77), and seniors the lowest 
correlation ( . 54) . 

The SAT-V and -M scaled scores summarized for the May administration 
indicate that juniors taking the test at this administration score higher on 
both variables than seniors. Sophomores also score higher than seniors on both 
SAT-V and SAT-M. Sophomore candidates score higher than junior candidates on 
SAT-M but lower on SAT-V. Scores for the sophomore candidates are based on a 
small sample (N=38) however, and have fairly large standard deviations (126 and 
122, respectively). Correlations between SAT-V and SAT-M scores range from .66 
for the sophomore candidates to .53 for the senior candidates. 

Perusal of the data obtained for the June group taking the English 
Composition Test (shown in Table 6) indicates that freshmen (N=36) obtained the 
highest SAT-V scaled score mean and an SAT-M scaled score mean equal to that of 
the sophomores. Roth freshmen and sophomores obtained higher SAT-V and SAT-M 
scaled score means than the junior or senior candidates. As was the case for 
the previous four administrations, juniors appear to be more able (as assessed 
by SAT-V and SAT-M scores) than seniors. The correlations between SAT-V and 
SAT-M scores range from a low of .42 for freshmen candidates to a high of .58 
for juniors who took the test. 

The following trends in both SAT-V and SAT-M scores across the five 
administrations can be noted. First, considering only junior and senior 
candidates, scaled score means for SAT— V range from a low of 470 for May senior 
candidates to a high of 560 for juniors taking the test in December. Scaled 
score means for SAT-M range from a low of 533 for May senior candidates to a 
high of 630 for November juniors. Further, it should be noted that the means 
for these tests vary considerably across grade level and administration and in 
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most Instances, they are considerably above a scaled score mean of 500. 

Secondly, It Is important to note the relationship between SAT-V and SAT-M 
scaled scores, as indicated by the correlation between these scores, varies 
across grade levels and administrations. These correlations range from a high 
of .77 for sophomores taking the test in January to a low of ,42 for freshmen 
taking the test in June. In general, most of the correlation coefficients 
appear to range between .55 and .60. 

English Composition Test: Correlational Analyses 

Table 7 contains correlation coefficients for Achievement Test scores with 
16 of the 17 covariates for the total group taking the English Composition Test 
at the five administrations. It should be noted that correlations were computed 
for all examinees taking the test at a particular administration who had scores 
on both variables of interest. Thus, as mentioned previously, sample sizes vary 
from one pair of variables to another. Table I, found in the appendix, contains 
the correlation matrices summarized in Table 7. The matrices found in Table T 
list the sample size for each cell in the matrix. It should be noted that no 
correlations of English Composition Test score with variable 17 (Achievement 
Test Background Questionnaire responses) are given in Table 7. This is because 
no Background Questionnaire is administered with the English Compostion Test. 

Examination of the data given in Table 7 indicates that, for the November 
administration, correlations of the 16 covarlates with English Composition Test 
scores range from .01 for variable 7 (amount of high school course work in 
biology and the sciences) to .81 for variable 1 (SAT-V scaled scores). Variable 
2 (SAT-M scaled scores) has the second highest correlation with the English 
Composition Achievement Test score. Other variables that show a correlation 
with the English Composition Test scores of .30 or greater are variables 3, 10, 
11 and 13-16. (See Tables I and 2 for a description of the variables.) 



An examination of the data given In Table 7 for the group taking the test at 
the December administration Indicates a similar pattern to the one observed for 
the November correlations. The correlations with Achievement Test score range 
from .00 for variable 7 to .76 for variable 1. Again, SAT-M scaled scores 
^variable 2) offer the second highest correlation with English Composition Test 
scores. Other variables with correlations of .30 or greater are variables 3, 

10, 12, 13, and 15. In general, all the correlation coefficients appear to be 
slightly lower for data collected at the December administration than for those 
collected In November. 

Data obtained at the January administration shows similar trends to those 
obtained for the November and December administrations. SAT-V and SAT-M scaled 
scores (variables 1 and 2) correlate most highly with English Composition Test 
score. However, a number of other variables (variables 3, 10, 13, 15 and 16) 
show moderate relationships with the English Composition Test scores. 

The correlation coefficients observed for the May and June data are similar 
to those observed for the other administrations. As expected, SAT-V and SAT-M 
scaled scores are the two variables most highly correlated with English 
Composition Achievement Test score. For both administrations, variable 7 
(amount of high school course work In biology and the sciences) has the lowest 
correlation with English Composition score, whereas for both administrations 
(with the exception of SAT— V and SAT— M scores) , variable 3 (high school class 
rank) has the highest correlation with English Composition Test score. Other 
variables that show a moderate relationship with English Composition Test scores 
obtained at the two administrations are variables 6, 10, 11, 12, 13, 14 and 15. 
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English Composition Test; Covarlate-Selectlon Analyses 

Results of the stepwise regression analyses carried out for the English 
Composition Test are summarized In Table 8. It Is Important to note that the 
analyses were carried out only for those examinees who had scores on all the 
variables Included in the model. Thus, sample size varies considerably from 
analysis to analysis. Table II, found in the appendix, contains 
intercorrelation matrices for the analysis groups. Sample sizes are presented 
in this table. Analyses were not interpreted for groups with sample sizes of 
fewer than 100 cases. 

Examination of the information presented in Table 8 indicates that, for the 
sophomore June analysis, the best two variable model is one that includes 
variables 1 and 2 (SAT-V and SAT-M scaled scores). This model accounts for 62% 
of the total variance. The best five variable model is one that includes 
variables 1 and 2 as well as variable 6 (amount of high school course work in 
foreign languages), variable 16 (number of honors or awards received during high 
school years), and variable 19 (father’s level of education). Use of all 5 
variables represents an increase in total variance accounted for of 
approximately 2.2%. 

The regression analyses conducted for high school juniors taking the English 
Composition Test at the five administrations show some similarity in that the 
best two variable model for all administrations, with the exception of the 
December administration, is one that involves SAT-V and SAT-M scores as 
covarlates. For the December data, the best two variable model includes 
variable I (SAT-V scaled scores) and variable 12 (grades received in a foreign 
language^. Using data obtained from seniors taking the English Composition Test 
in June, the best two variable model was found to be one that includes variable 
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1 and variable 10 (grades received In English.) The percentage of variance 
accounted for by the two variable models ranges from a low of 59% for the 
analysis carried out for seniors taking the test in December to a high of 66% 
for seniors taking the test in November. The percentage of variance accounted 
for by the best 5 variable models ranges from a low of 61%, again for December 
seniors, to a high of 67% for November seniors. 

Finally, the analyses carried out for the total groups taking the English 
Composition Test at the five administrations indicate that, for all groups 
except those examinees taking the test at the December administration, the best 
two variable model is one that uses scores on SAT-V and SAT-M. For the December 
group, the best two variable model appears to be one that uses scores on SAT-V 
and grades received in foreign languages. It is Interesting to note that grades 
received in foreign languages (variable 12) is included in the best three 
variable model for all administration groups with the exception of the January 
group. For this group, the best three variable model Includes SAT-V and SAT-M 
scores and variable 10 (grades received in English). The percentage of variance 
accounted for by the best five variable models ranges from a low of 61% for the 
model fit to the data from the December administration to a high of 67% for the 
model specified for the May group. 

Literature Test: Summary Statistics 

Summary statistics for the Literature Test by grade level and administration 
are found in Table 5. As was the case for the English Composition Test, it is 
important to note that total group Includes some candidates who did not specify 
grade level. 

Examination of the data presented in Table 5 Indicates that the Literature 
Test is not typically taken by high school freshmen, and that the number of 
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sophomores taking the test is substantial only for the June administration. The 
test appears to be taken predominantly by seniors at the November, December and 
January administrations and juniors at the June administration. The grade level 
composition of the groups taking the test at the May administration Is mostly 
juniors and seniors. 

An examination of the Literature Test scaled score means for all groups 
containing 25 or more examinees indicates several trends. For the November 
administration, juniors score higher than seniors; however. It should be noted 
that junior scores are based on a sample of only 33 examinees, probably very 
highly self selected and that these scores are quite variable. The data 
collected for the December administration again shows that juniors score higher 
than seniors and also that junior scores are considerably more variable than 
senior scores. The same pattern of scaled score means Is noted for the May and 
June administrations, l.e., juniors score higher on the Literature Test than 
seniors and their scores are more variable. The greatest discrepancies between 
junior and senior means (36 points) occur for the June administration. It Is 
Interesting to note that the highest scores of any grade level taking the test 
at the June administration are those obtained by sophomores. 

Various trends become clear when the achievement levels of the groups (as 
assessed by Literature Test scores) are evaluated across the five administra- 
tions. High school juniors, taking the test In December, are the most able 
junior group (as well as the most able of any group with 25 or more candidates) 
taking the test at any of the five administrations. The least able junior group 
Is the group of students who took the test In May. The remaining junior means 
can be arranged from high to low as follows: November, June, and January. 



O 




44 



- 31 



The Literature Test scaled score means obtained by those reporting their 
grade level as senior are much more similar across the five administrations than 
the junior means. The highest means (430 and 428) were obtained by the groups 
taking the test at the November and December administrations, respectively. The 
senior groups taking the test at the May and June administrations also received 
similar scaled score means (494 and 490, respectively) which were both lower 
than those received by the fall groups. The senior group taking the test at the 
January administration exhibited a scaled score mean between those obtained by 
the fall and spring groups. 

Table 6 presents SAT-V and SAT-M scaled score summary statistics for groups 
taking the Literature Test at the five administrations. Again, only summary 
statistics based on groups with 25 or more examinees are Included in the table. 
Also, as was noted for the English Composition Test, the total group Includes 
some students who did not specify grade level. 

A comparison of SAT-V and SAT-M scaled score means for juniors and seniors 
taking the test at the December Literature Test administration indicates that 
juniors score higher on both SAT-V and SAT-M than seniors. One interesting 
point to note is the similarity between the verbal and math mean scores obtained 
bv the juniors and, also, those obtained by the seniors. Typically, most 
Achievement Test groups exhibit a sizable difference in these scores, with SAT-M 
scores being higher than SAT-V scores. 

Examination of the data obtained for the January administration indicates 
that high school juniors scored higher than high school seniors on both SAT-V 
and SAT-M. This same trend is apparent for both the May and June groups. When 
examining the June data, it should be noted that the highest scoring group on 
both SAT-V and SAT-M were the sophomores (N*36) who took the test at this 
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The relationship between SAT-V and SAT-M scores (as assessed by the 
correlation coefficient) varies across administrations and grade levels. These 
correlations range from .49 for juniors taking the test in December to .66 for 
seniors taking the test in June. 

There are several trends In SAT-V and SAT--M scores across the five 
administrations that are worth noting. Generally, the most able group (as 
assessed by SAT-V and SAT-M scores) taking the test at all administrations, with 
the exception of June sophomores. Is the junior group. The junior and senior 
groups taking the test In November, December and June are the highest scoring 
groups that take the test. The lowest scoring groups, on both SAT-V and SAT-M, 
are the groups taking the test in May. It should be noted that, as was the case 
for the groups taking the English Composition Test, the SAT-V and SAT-M scaled 
score means vary considerably according to grade level and administration and 
are. In most Instances, considerably above a scaled score of 500. It should 
also be noted that the relationship between SAT-V and SAT-M scores varies with 
grade level and administration. In general, most of the correlations range 
between .56 and .64. 

One unusual point observed is the magnitude of SAT-V and -M scaled score 
means when paired for a particular group. As mentioned previously. In several 
Instances, these means are more similar (e.g., December juniors and seniors, 

June juniors) than typically observed for Achievement Test takers. 

Literature Test: Correlational Analyses 

Table 7 contains correlation coefficients for the Literature Test score with 
16 of the 17 experimental covarlates for the total group taking the Literature 
Test at the five administrations. As mentioned previously, it Is Important to 
note that the correlations were computed for all examinees taking the test at a 
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particular administration who had scores on both variables of interest; hence, 
sample size varies from one correlation coefficient to another. Also, as was 
the case for the English Composition Test, no Background Questionnaire is 
administered with the Literature Test, thus, no correlations of Literature Test 
scores with variable 17 appear in Table 7. 

Examination of the data given in Table 7 indicates that, for the November 
administration, correlations of Literature Test score with the 16 covariates 
range from a correlation of .84 with variable 1 (SAT-V scaled scores) to (not 
surprisingly) a correlati^^ of .00 with variable 7 (amount of high school course 
work in biology and sciences). The variable showing the second highest 
correlation with Literature Test score (.54) is variable 2 (SAT-M scaled 
scores) . Other variables that show a correlation with the Literature Test score 
of .30 or greater are variables 2, 3, 10, 13, and 15. (See Tables 1 and 2 for a 
description of these variables.) 

The data given in Table 7 for the December administration of the Literature 
Test show similar patterns in correlations to those observed for the November 
data. Again, variable I (SAT-V scaled scores) shows the highest correlation 
with Literature Test score (.82), variable 7 the lowest ^-.01), and variable 2 
(SAT-M scaled scores) the second highest correlation (.49). Other variables 
with correlations of .30 or greater with Literature Test scores obtained at the 
December administration are variables 2, 3 and 15. 

Data obtained at the January, May and June Literature Test administrations 
show similar patterns to those obtained at the two administrations .just 
discussed. In all cases, the variable that correlates most highly with 
Literature Test scores is SAT-V scaled scores, SAT-M scaled scores provide the 
second highest correlation and amount of high school course work in biology and 
sciences (variable 7) shows the lowest correlation. 
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Literature Test: Covarlate-Selectlon Analyses 

Results of the Literature Test regression analyses are summarized in Table 
8. As mentioned previously, it is important to note that the analyses were 
carried out only for those examinees who had scores on all the variables 
included in the model; thus, sample size varies considerably from one analysis 
to another. The reader is referred to Table II in the appendix for the inter- 
correlation matrices and sample sizes used for the analyses. Analyses were not 
interpreted for groups with sample sizes of less than 100 cases. 

The information presented in Table 8 for the Literature Test indicates that, 
for the junior May analysis, the best two variable model is one that contains 
variable 1 (SAT-V scaled scores) , and variable 4 (amount of course work in 
English). This model accounts for 72% of the total variance. The best five 
variable model is one that also Includes variable 2 (SAT-M scaled scores) , 
variable 6 (amount of course work in foreign languages) and variable 16 (number 
of honors or awards received.) Use of all five variables represents an 
Increase, from the two variable model, of 1% in the total variance accounted 
for. 

The analyses for the junior June sample produced slightly different results 
than those obtained using May data. The best two variable model, which accounts 
for 69% of the total variance, is one that Includes variables 1 and 10 (SAT-V 
scaled scores and grades in English). The best five variable model, which again 
represents an Increase of 1% in the total variance accounted for. Includes 
variables 1, 6, 8, 10, and 16. These variables are (in the order that they 
appear in the model) SAT-V scaled scores, course work in foreign languages, 
course work in physical sciences, grades in English, and number of honors or 
awards. 




48 



35 - 



The analyses conducted for high school seniors taking the Literature Test at 
the five administrations shows some similarity in that the best two variable 
model for the November, December, January and May administrations is one that 
includes variable 1 (SAT-V scaled scores) and variable 10 (grades in English). 

For June seniors, the best two variable model appears to be one that employs 
variable 1 (SAT-V scaled scores) and variable 3 (high school class rank). The 
percentage of variance accounted for by the two variable models ranges from 68% 
for the December and May seniors to 71% for seniors taking the test at the other 
three administrations. 

Increasing the number of variables in the model from two to five for the 
senior level data had little effect on the overall percentage of variance 
accounted for. The five variable models are not consistent across the different 
administrations, although they all have in common, variable 1 (SAT-V scaled 
scores) and variable 10 (grades in English) . 

The results of the total group analyses for the Literature Test are quite 
similar to those obtained for the senior data, with the exception that variables 
1 and 10 constitute the best two variable model for all administrations. The 
percentage of variance accounted for by this model ranges from 68% to 70% across 
the five administrations. As was the case for the senior level data, increasing 
the number of variables in the model from two to five does not Increase the 
percentage of variance accounted for by much. Again, the only variables that 
consistently enter into the five variable models, across the five administra- 
tions, are variable 1 (SAT-V scaled scores) and variable 10 (grades in English). 

American History and Social Studies Test? Summary Statistics 

Summary statistics by grade level and administration for the American 
History and Social Studies Test are found in Table 5. Perusal of the 
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information found in this table indicates that, as was the case for the English 
Composition Test and the Literature Test, the American History Test is taken 
predominantly by high school juniors and seniors, with seniors tending to take 
the test in the fall and juniors tending to take the test in the spring. It is 
interesting to note that a surprisingly large number of sophomores take the test 
at the June administration. The only freshmen group containing 25 or more 
candidates is the one taking the test at the June administration. 

Examination of the Information presented in Table 5 indicates that high 
school juniors score higher than seniors at all five administrations. High 
schoo] sophomores taking the test at the June administration score the same as 
the juniors who took the test at this administration. And, as was the case for 
the juniors, the sophomores score higher than the June seniors. It is worth 
noting that the small number of high school sophomores who took the test at the 
December administration (N-25) scored higher than either the juniors or seniors 
who took the test at the same administration. 

The achievement level of the groups (as measured by the American History and 
Social Studies Test) varies considerably across grade levels and administra- 
tions. The junior means can be ranked high to low in the following order: 

June, November, May, December and January. For the senior group, the means, 
ordered by administration (again high to low), are November, December, January, 
June and May. 

SAT-V and SAT-M summary statistics for students taking the American History 
and Social Studies Test at the five administrations are found in Table 6. 
Comparison of the junior and senior means for all five administrations indicates 
that high school juniors consistently score higher than high school seniors on 
both SAT-V and SAT-M. Sophomores taking the test in June score considerably 
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higher on both SAT-V and SAT-M than either the juniors or seniors who took the 
test at this administration. 

As expectec’, the relationship between SAT-V and SAT-M scores, as described 
bv the correlation coefficient, varies by grade level and administration, with a 
range from .42, for juniors taking the test in December, to ,66 for iunlors 
taking the test in January. 

Scaled score means for both SAT-V and SAT-M vary considerably across the 
various grade levels and administrations. SAT-V means range from a low of 480 
for seniors who took the test in May to a high of 557 for sophomores taking the 
test in June. Scaled score means on SAT-M range from a low of 504 obtained bv 
seniors taking the test in June to a high of 603, again for sophomores taking 
the test in June. In general, the scaled score means on both SAT-V and SAT-M 
for most grade levels and groups are considerably above 500. 

American History and Social Studies Test: Correlational Analyses 

Table 7 contains correlation coefficients for the American History and 
Social Studies Test scores with 16 of the 17 covarlates used in the study. The. 
correlations are for the total group taking the test at the five 
administrations. The reader is again referred to Table I in the appendix for 
complete correlation matrices and accompanying sample sizes. It should be noted 
that Table 7 does not contain correlations of American History Test scores with 
variable 17; this is because no Background Questionnaire is given with the 
American History and Social Studies Test. 

Examination of the data given in Table 7 indicates that, for the November 
administration, correlations of the 16 covarlates with the American History Test 
score range from a high of .76 for variable 1 (SAT-V scaled scores) to a low of 
♦01 for variable 7 (amount of course work in biology and sciences). Variable 2 
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(SAT-M scaled scores) has the second highest correlation with American History 
Test scores. Other variables that show a moderate relationship with the 
American History and Social Studies Test scores (a correlation of .30 or 
greater) are variables 3, 10, 13 and 15. These variables are, respectively, 
high school class rank, grades In English, grades In biology and sciences, and 
grades In social studies. 

Data obtained at the December administration exhibit similar patterns of 
correlation coefficients. American History Test scores correlate most highly 
with variable 1 (SAT-V scaled scores) and second highest with variable 2 (SAT-M 
scaled scores' . Variables 3, 10, and 15 all show correlations greater than .30 
with the American History and Social Studies Test scores. 

An examination of the correlation coefficients obtained for data from the. 
remaining three administrations shows the same relationship between American 
History and Social Studies Test scores and the 16 covariates. For each 
administration, the variable that correlates most highly with American History 
Achievement Test scores is variable 1 (SAT-V scaled scores), and the second most 
highly correlated variable is variable 2 (SAT-M scaled scores). It Is 
interesting to note that relationships with variables 6, 11, 12 and 14, which 
are, respectively, course work In foreign languages, grades In mathematics, 
grades in foreign languages, and grades In physical sciences, appear to be 
stronger for data collected at the two spring administrations of the test than 
for data collected at fall administrations. 



American History and Social Studies Test: Covarlate-Selectlon Analyses 

Results of the regression analyses carried out for the American History and 
Social Studies Test are summarized In Table 8. The reader Is referred to Table 
II of the appendix for Intercorrelation matrices of the variables used In the 
analyses. 




52 

BEST COPY AVAILABLE 



39 



The regression analyses conducted for the high school sophomores taking the 
test at the June administration indicate that the best two variable model is one 
that contains variable 1 (SAT-V scaled scores) and variable 19 (father's level 
of education). This model accounts for 60% of the total variance. The best 
five variable model includes variables 1, 4, 5, 15 and 19 which are 
respectively, SAT-V scaled scores, amount of course work in English, amount of 
course work in mathematics, grades in social studies, and father's level of 
education. The five variable model accounts for 65% of the total variance, an 
increase in 5% over that accounted for by the two variable model. 

The analyses carried out for high school juniors who took the test in May 
and June indicate that the best two variable model for both these groups is one 
that includes variable 1 (SAT-V scaled scores) and variable 15 (grades received 
in social studies). The two variable model accounts for 60% of the total 
variance for the May group and 61% of the total variance for the June group. 

The analyses carried out for the seniors indicate, across all five 
administrations, that the best two variable model is one that includes variables 
1 and 15 (SAT-V scaled scores and grades received in social studies, 
respectively). The percentage of variance accounted for by this model across 
the five administrations ranges from 56% for the January group to 64% for the 
June group. 

The results of the analyses carried out for the total group were quite 
similar to those obtained using senior data. The best two variable model for 
the total group, for all five administrations, is one that contains variable 1 
fSAT-V scaled scores) and variable 15 (grades in social studies). The 
percentage of variance accounted for by this model ranges from 56% for data 
obtained from the January administration to 63% for data obtained from the Mav 
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administration. The best three variable model is also consistent across the 
five administrations. This model Includes variables 1 and 15, but also variable 
2 fSAT-M scaled scores). Addition of SAT-M scaled scores to the model allows, 
in most cases, for an additional 1-2% of the total variance to be accounted for. 
The addition of fourth and fifth variables to the models did not show 
substantial increases in the percentage of variance accounted for fabove that 
accounted for by the three variable model) for many of the administrations. The 
three common variables (across the administrations) included in the five 
variable models were variables 1, 2 and 15 (SAT-V and -M scaled scores and 
grades in social studies). 

Mathematics Level I Test: Summary Statistics 

Achievement Test summary statistics for the Mathematics Level I Test are 
found in Table 5. Similar to tests that have been previously examined, it is 
apparent that the Mathematics Level 1 Test is taken predominately by high school 
juniors and seniors. Sophomores do constitute a reasonable number of the 
examinees who take the test at the May and June administrations and a sizeable 
number of freshmen take the test in June. 

An examination of the Math Level I Achievement Test scaled score means for 
the various groups (only those with sample sizes greater than 25 are Included in 
Table 5) indicates that sophomores and juniors score similarly on the November 
administration of the test, with high school seniors scoring considerably lower 
at this same administration. For the remaining administrations, sophomores 
appear to be the highest scoring group and seniors the lowest. Scores obtained 
by high school juniors are more or less Intermediate to those obtained by 
sophomores and seniors. Scores obtained by freshmen taking the test at the May 
administration are higher than those obtained by juniors. The opposite effect 
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observed in the data obtained for the June administration, i.e«, freshman 
leaking the test at this administration score lower than high school iuniors. 

... — -Uheu one considers the achievement levels of the groups (as assessed by mean 
scores on the Mathematics Level T Test) across the five administrations, the 
following trends become clear. First, the mean scores obtained by seniors rank 
consistently, high to low, from the November to the June administration. This 
consistency in ranking of mean scores is not apparent when one examines the 
scores obtained by high school juniors. Mean scores obtained by juniors are 
ranked high to low across the five administrations as follows: November, 

January, December, June, and May. 

Table 6 contains SAT-V and SAT-M summary statistics by grade level for 
candidates taking the Mathematics Level I Test at the five administrations. 
Examination of the data presented in Table 6 for the students taking the 
Mathematics Level T Test indicates that high school juniors score higher than 
high school seniors on both SAT-V and SAT-M for all five administrations. 
Considering only high school juniors, SAT-V scaled score means range from a low 
of 501, for the group taking the test in May, to a high of 534 for the December 
candidates. Scaled score means on SAT-M obtained by junior candidates range 
from 554 (for the May group) to 604, obtained by high school iuniors who take 
the test in November. Scaled score means on SAT-V for senior candidates range 
from 460 (candidates taking the test in January) to 507 for candidates taking 
the test in November. Senior scaled score means on SAT-M range from 522 for the 
May group to 564 for the November group. 

Tn general the correlations between SAT-V and SAT-M scores are higher for 
the iunior groups than for the seniors. These correlations range from .53 to 
.59 for jtinlors and from .50 to .55 for seniors. 



O^J 



- 42 - 



For the May administration, sample sizes are large enough to consider mean 
scores obtained by sophomores, as well as those obtained by juniors and seniors. 
It is interesting to note that, for this administration, sophomores score 
considerably higher on both SAT-V and SAT-M than high school juniors and 
seniors. In addition, the high correlation between SAT-V and SAT-M scores (,66) 
obtained by this group is noteworthy. 

Sample sizes for groups taking the Mathematics Level I Test at the June 
administration are large enough to consider mean scores obtained by all four 
grade levels. Sophomores obtained the highest scores on SAT-V for this admini- 
stration,* but high school freshmen obtained the highest scores on SAT-M, For 
both freshmen and sophomores, the correlations between SAT-V and SAT-M scores 
were quite low (,35 for freshmen and ,47 for sophomores). 

Mathematics Level I Test: Correlational Analyses 

Table 7 contains correlation coefficients for Achievement Test scores with 
17 covarlates for the total group taking the Mathematics Level I Test at the 
five administrations. It should be noted that a Background Questionnaire is 
administered with the Mathematics Level I Test; hence, correlations of test 
scores with variable 17 appear in Table 7, Examination of the data given in 
Table 7 indicates that, for the November administration, correlations of the 17 
covarlates with the Mathematics Level I Test scores range from -.03 for variable 
7 (amount of high school course work in biology and the sciences) to .84 for 
variable 2 (SAT-M scaled scores). Variables 1 and 11 (SAT-V scaled scores and 
grades in mathematics, respectively) correlate the same (,50), and second 
highest with Math Level I Achievement Test score. Other variables with moderate 
correlations (.30 or greater) are variables 3, 5, 8, 13, 14 and 17, (See Tables 
1 and 2 for a description of these variables,) 
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An examination of the data given in Table 7 for the December administration 
indicates a similar correlational pattern to the one observed for November. The 
correlations with Achievement Test score range from -.04 (variable 7 ) to .82 for 
the correlation with variable 2 (SAT-M scaled scores) • Other variables showing 
correlations with Achievement Test score of .30 or greater are variables 1, 3, 

5, 8, 11, 13, 14 and 17. In general, most of the correlations exhibited by the 
December data are slightly lower than those observed for data collected at the 
November administration. 

Results of the analyses carried out using data obtained at the January 
administration show similar trends to those obtained for the November and 
December analyses. SAT-M scaled scores (variable 2) and SAT-V scaled scores 
(variable 1), as well as grades in mathematics (variable 11), correlate most 
highly with Mathematics Level I scores. However, a number of other variables 
(variables 3, 5, 8, 13, 14, and 17) show correlations of .30 or greater with 
Achievement Test score. 

The correlation coefficients observed for the May and June data are similar 
to those observed for the other administrations. As expected, SAT-M scaled 
scores correlate most highly with scores obtained on the Mathematics Level I 
test. SAT-V scaled scores and grades in mathematics correlate the second most 
highly with Achievement Test scores. The correlation of Mathematics Level I 
scores with variable 17 (Achievement Test Background questionnaire responses) is 
lower for the May and June data than for data obtained at the fall 
adml nistrations . 

Mathematics Level I Test: Covariate-Selection Analyses 

Results of the regression analyses carried out for the Mathematics Level I 
Test are summarized in Table 8. As mentioned previously, no analyses for groups 
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containing fewer than 100 examinees are tabled. Examination of the Information 
presented In Table 8 Indicates that, for the sophomore June analysis, the best 
two variable model Is one that Includes variables 2 and 11 (SAT-M scaled scores 
and grades In mathematics). This model accounts for 63% of the total variance. 
The best five variable model Is one that Includes variables 2, 17, 8, 11 and 21. 
Employment of all five variables results In an Increase In the amount of 
variance accounted for of 2.2%. 

The regression analyses conducted for the high school juniors taking the 
test at the five administrations show some similarity across administrations. 

The best two variable model for most of the administrations Is one that Includes 
variables 2 and 11 (SAT-M scaled scores and grades In mathematics, 
respectively). The amount of variance accounted for by this model, used with 
data from the December, May and June administrations, ranges from 71% to 74%. 

The best two variable model for juniors taking the test at the November 
administration appears to be one that includes variable 2 fSAT-M scaled scores) 
and variable 17 (Achievement Test Background Questionnaire responses). This 
model accounts for 74% of the variance in the November data. For the January 
administration, the best two variable model for high school juniors taking the 
test is one that Includes variable 2 and variable 8 (amount of course work In 
physical sciences) . 

An examination of the regression analyses carried out for high school 
seniors taking the Mathematics Level I Test Indicates that. In most cases, the 
best two variable model Is one that Include variables 2 and 17 (SAT-M scaled 
scores and Achievement Test Background Questionnaire responses, respectively). 
The one exception to this pattern Is the model fit to data obtained at the June 
administration. For these data, the best two variable model Is one that 
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includes variable 2 and variable 11 (grades in mathematics). Tt is important to 
note that the best three variable model for high school seniors taking the test 
at all five administrations is one that includes variables 2 , 17 and 11. The 
total variance accounted for by this model, applied to data from the five 
administrations, ranges from 67% to 72%. 

The analyses carried out for the total group indicates that the best three 
variable model for the November - May administrations is one that includes 
variables 2, 17 and 11. For the total group taking the Mathematics Level T Test 
at the June administration , the best three variable model is one that includes 
variables 2, 11 and variable 1 (SAT-V scaled scores). The total variance 
accounted for by the three variable models applied to data from the five 
administrations ranges from 677 to 73%. 

Mathematics Level II Test; Summary Statistics 

Summarv statistics for the Mathematics Level II Test bv grade level and 
administration are found in Table 5. Examination of the data presented in Table 
S Indicates, as has been the case for the other tests discussed so far, that the 
Mathematics Level II Test is not typically taken by high school freshmen or 
sophomores. The only groups for these grade levels containing more than 25 
examinees are sophomores taking the test at the January and May administrations 
and freshmen and sophomores taking the test at the June administration. 

An examination of the Mathematics Level IT scaled score means summarized in 
Table 5 indicates several trends. For all five administrations, the highest 
scaled score means for the Mathematics Level II Test are those obtained bv the 
sophomores who took the test. For the June administration, the highest scaled 
score mean on the Achievement Test was obtained by high school freshmen and the 
second highest by sophomores. In general, the highest scores were obtained bv 
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the groups taking the test in November and the lowest by the groups who took the 
test in May. 

Table 6 presents SAT-V and SAT-M scaled score summary statistics for groups 
taking the Mathematics Level II Test at the five administrations. A comparison 
of SAT-V and SAT-M scaled score means obtained by high school iunlors and 
seniors indicates that juniors score higher than seniors on both tests at all 
five administrations. 

High school sophomores who took the Mathematics Level II Test at the May 
administration scored higher on SAT-M than either juniors or seniors. This same 
group scored lower on SAT-V than juniors but higher on this test than seniors. 

An examination of data obtained at the June administration of the Mathematics 
Level II Test indicates that high school freshmen taking the test at this 
administration scored higher on SAT-M than any other grade level . High school 
freshmen scores on SAT-V were lower than those obtained bv sophomores or 
juniors, but higher than those obtained by seniors taking the test at the June 
administration. 

In general, SAT-V scaled score means obtained by juniors taking the test 
across the five administrations ranged from 541, for the group taking the test 
at the November administration, to 576 for the group taking the test at the 
December administration. Scaled score means on SAT-M for high school juniors 
ranged from 649 for the group that took the test in May to 675 for the group 
that took the test in December. Senior SAT-V means ranged from a low of 483 for 
the group taking the test at the Mav administration to 557 for the group taking 
the test at the November administration. Senior scaled score means on SAT-M 
range from a low of 569 for the group that took the test at the May 
administration to a high of 656 for the group taking the test in November. 
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The relationship between SAT-V and SAT-M scores (as assessed by the 
correlation coefficients) varies across administrations and grade levels. These 
correlations range from .43 for luniors taking the test at the December 
administration to .60 for juniors taking the Mathematics Level TI Test in 
November. 

Mathematics Level II Test; Correlational Analyses 

Table 7 contains correlation coefficients for the Mathematics Level II Test 
score with 17 covarlates for the total group taking the Mathematics Level II 
Test at the five administrations. It should be noted that a Background 
Ouestlonnalre is administered with the Mathematics Level II Test; thus, 
correlations of Achievement Test score with variable 17 appear in Table 7. 
Examination of the data given in Table 7 indicates that, for the November 
administration, correlations of the Mathematics Level II Test scores with the 17 
covariates range from a correlation of .00 for variable 7 (amount of course work 
in biology and sciences) to a high of .79 for variable ? (SAT-M scaled scores). 
The variable correlating the second highest with Mathematics Level IT test score 
Is variable 1 (SAT-V scaled scores). Other variables that show a correlation 
with Math Level II Achievement Test score of .30 or greater are variables 3, 11, 

1 4 and 1 7 . 

The data given in Table 7 for the December administration of the Mathe- 
matics Level II Test show similar patterns in correlations to those observed for 
the November data. Again, variable 2 (SAT-M scaled scores) shows the highest 
correlation with Mathematics Level TI Test scores. Other variables with 
correlations of .30 or greater with the scores obtained at the December 
administration are variables 1, 3, 11, 14 and 17. 
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Data obtained at the January, May and June Mathematics Level II Test 
administrations show similar patterns to those obtained at the two 
admlnlst ralons just described. In all cases, the variable that correlates most 
highly with Mathematics Level II Test scores is SAT-M scaled scores. SAT-V 
scaled scores correlate the second highest and variable 11 (grades in 
mathematics) correlates the third highest. 

Mathematics Level II Test; Covariate-Selection Analyses 

Results of the Mathematics Level II regression analyses are summarized in 
Table 8. As has been the case for the regression analyses described so far, 
analyses are not tabled for groups with sample sizes of fewer than 100. 

The information presented in Table 8 for the Mathematics Level II Test 
indicates that for the sophomore June analysis, the best two variable model is 
one that includes variables 2 (SAT-M scaled scores) and variable 8 (amount of 
course work in mathematics.) The best five variable model for this group is one 
that Includes variables 2, 17, 6, 8, and 14. The total variance accounted for 
bv the five variable model is approximately 47%. 

The analvses carried out for the December and June junior groups indicate 
that the best two variable model for both groups is one that Includes variable 2 
(SAT-M scaled scores) and variable 11 (grades in mathematics). This model 
accounts for 68% of the total variance for the December group and 62% of the 
total variance for the June group. The best three variable model for the 
December and June groups is not consistent across the two administrations. The 
model specified for the December juniors is one that Includes variables 2, 17, 
and 11 (SAT-M scaled scores. Achievement Test Background Questionnaire 
responses, and grades In mathematics, respectively). The best three variable 
model specified for the June group is one that Includes variables 2 and 11, but 
also variable 1 (SAT-V scaled scores). 
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The analyses conducted for the high school seniors taking the Mathematics 
Level II Test at the five administrations show some similarity in that the best 
two variable model specified for all administrations* with the exception of 
June, is one that includes variables 2 and 17, (SAT-M scaled scores and 
Achievement Test Background Questionnaire responses). For the June 
administration, the best two variable model is one that includes variables ? and 
11 fSAT-M scaled scores and grades in mathematics). 

The best three variable models specified for seniors taking the test at the 
five administrations show some similarity in that, for all administrations, 
variables and 17 are included in the models. Differences occur in models that 
Include variable 1 (SAT-V scaled scores) or variable 11 (grades in mathematics). 

The results of the total group analvses for the Mathematics Level IT Test 
are quite similar to those obtained for the senior data. Tt is interesting to 
note that, for all administrations, the best four variable model is one that 
Includes variables 1 , ? , 11 and 17. The percentage of variance accounted for bv 
this model ranges from a low of 63% for the November and December administration 
to a high of 73% for the May administration. 




Biology Test: Summary Statistics 

Summary statistics bv grade level and administration for the Biology Test 
are found in Table 5. Perusal of the information found in this table indicates 
that, for the fall administrations (November, December, January), the test is 
taken predominately by high school iuniors and seniors, as has been the case for 
other Achievement Tests discussed so far. The situation is quite different for 
the spring administrations (May and June). A surprisingly large number of high 
school freshmen and sophomores take the test at both of these administrations, 
with freshmen and sophomores constituting the ma^or portion of test takers for 
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the .Tune administration. 



Examination of the information presented in Table 5 indicates that hijjh 
school sophomores that take the test at the fall administrations score higher 
than either iuniors or seniors. For the spring administrations, the trend is 
reversed and iuniors score higher than sophomores. High school freshmen score 
almost as high as iuniors who take the test in May and are the highest scoring 
group of any grade level to take the test in vTune. High school seniors are 
consistently the lowest scoring grade level group to take the test at any of the 
five administrations. The total group means for the spring administrations are 
considerably higher than the total group means obtained by the fall groups. 

This appears to be due mainly to the high scoring freshmen and sophomores who 
take the test in Mav and June. 

SAT-V and SAT-M summary statistics for students taking the Biology Test at 
the five administrations are found in Table 6. It is immediately apparent, if 
one compares the sample sl?.es for the freshmen and sophomores who take the test 
in June with the sample sizes reported for the same groups in Table 5, that only 
a few of the freshmen and sophomores taking the Blologv Test in June had 
available SAT scores. 

Comparison of the iunlor and senior means at all five admlnlst»rations 

•s consistently score higher than high school 



! obtained for iuniors taking the test in 
November to a low of 522 for those students taking the test in January. Jtinior 

SAT-M means range from a high of 651 for the November group to a low of 579, 

obtained by the Ma;^^ and June groups. High school senior SAT-V scaled score 

means range from 519 to 469, with the highest mean obtained bv the November 

group and the lowest by the June group. SAT-M scaled score means obtained bv 




SAT-V scaled score means for high school 



- 51 



high school seniors range from a high of 584 for the November group to a low of 
527 for the June group. 

The only freshmen and sophomore groups that contain 25 or more candidates 
are those students taking the test at the June administration. Both of these 
groups scored lower on SAT-V than the high school iunlors taking the test at 
this administration; however, freshmen and sophomore SAT-V scores were higher 
than those obtained by seniors. Students reporting their grade level as 
freshmen scored higher on SAT-M than any other grade level groun taking the 
Biologv Test in June. Those reporting their grade level as sophomores scored 
lower on SAT-M than freshmen and iuniors but higher than seniors taking the test 
at this administration. 

As expected, the relationship between SAT-V and SAT-M scaled scores (as 
described by the correlation coefficient between these scores) varies bv a grade 
level and administration, with a range from .43 for freshmen taking the test in 
June to .74 for iuniors taking the test In November. In general, the 
correlation between SAT-V and SAT-M scaled scores for the total group ranges 
between .56 and .63 across the five administrations. 

Biologv Test; Correlational Analyses 

Table 7 contains correlation coefficients for the Biology Test scores with 
16 of the 17 covariates used in the study. The correlations are for the total 
group taking, the test at the five administrations. The reader is referred to 
Table I of the appendix for complete correlation matrices and accompanving 
sample sizes. It should be noted that Table 7 does not contain correlations of 
Biology Test scores with variable 17; this is because no Background 
Questionnaire is given with the Biologv Test. 
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Examination of the data given in T^ble 7 indicates that> for the November 
administration » correlations of the 16 covariates with Biology Test scores range 
from a high of .75 for variable 1 fSAT-V scaled scores) to a low of .06 for 
variable 4 (amount of course work in English). Variable 2, fSAT-M scaled 
scores) has the second highest correlation with Biology Test scores. Other 
variables that show a moderate relationship with Biology Test scores (a 
correlation of .30 or greater) are variables 3, 10, 11, 13, 14 and 15. These 
variables are, respectively, high school class rank, grades in English, grades 
in mathematics, grades in biology and sciences, grades in physical sciences, and 
grades in social studies . 

Pesults of the analyses of data obtained at the December administration 
exhibit similar patterns of correlation coefficients. Biology Test scores 
correlate most highly with variable 1 fSAT-V scaled scores) and second highest 
with variable 2 (SAT-M scaled scores). Variables 3, 11, 13, 14 and 15 all show 
correlations of .30 or greater with Biology Test scores. 

An examination of the correlation coefficients obtained for data from the 
remaining three administrations show the same relationship between Biology Test 
scores and the 16 covariates. For each administration, the variable that 
correlates most highly with Achievement Test scores is variable 1 (SAT-V scaled 
scores) and the second most highly correlating variable is variable 2 (SAT-M 
scaled scores). 

Biology Test; Covariate-Selection Analyses 

Results of the regression analyses carried out for the Biology Test are 
summarized in Table 8. The reader is referred to Table II of the appendix for 
intercorrelation matrices of the variables used in the analyses. The regression 
analyses conducted for the high school freshmen taking the test in June indicate 
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that the best two variable model Is one that Includes variable 18 (highest level 
of education planned beyond high school) and variable 19 (father's level of 
education). This model accounts for approximately 40% of the total variance In 
Biology Test scores. The best five variable model for freshmen taking the test 
at the June administration accounts for approximately 56 % of the total variance 
and includes variables 3, 16, 18, 19 and 20 which are, respectively, high school 
class rank, number of honors or awards received during high school, highest 
level of education planned beyond high school, father's level of education and 
mother's level of education. 

The analyses carried out for high school sophomores taking the test In June 
Indicates that the best two variable model Is one that Includes variable 1 
(SAT-V scaled scores) and variable 2 (SAT-M scaled scores) . This model accounts 
for 57% of the total variance. The best three variable model for sophomores 
taking the test in June accounts for approximately 60% of the total variance and 
Is one that Includes variables 1 and 2 as well as variable 13 (grades In biology 
and the sciences) . 

The analyses carried out for high school jtinlors who took the test in 
December and June indicate that the best two variable model, for data collected 
at both these administrations. Is one that Includes variable 1 (SAT-V scaled 
scores) and variable 2 (SAT— M scaled scores). This model accounts for 
approximately 48% of the total variance for the December group and approximately 
55% of the total variance for the June group. The best three variable model for 
•juniors taking the test In December appears to be one that Includes variables 1 , 
? and 9 (amount of course work In social studies). This model accounts for 
approximately 52% of the total variance. The best three variable model for 
juniors taking the test at the June administration Is one that Includes 
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variables 1, 2, and 13 (grades in biology and sciences). This model accounts 
for approximately 57% of the total variance. 

The analyses carried out for the seniors indicate, across all five 
administrations, that the best two variable model is one that includes variable 
I and 2. The amount of variance in Blologv Test score acccounted for bv this 
model ranges from 55% for seniors taking the test in May to approximately 62% 
for the groups taking the test in November and June. The best three variable 
model is fairly consistent across the five administrations with the exception of 
the model specified for the November seniors. The model specified for this 
group Is one that includes variables 1,2 and 13. The best three variable model 
specified for the remaining four administrations Is one that specified variables 
1,2 and variable 7 (amount of course work in biology and the sciences). The 
percentage of variance accounted for by this model across the four 
administrations ranges from approximately 57% for data obtained at the May 

administration to 64% for data obtained for seniors taking the test at the June 

/ 

administration. 

The results of the analyses carried out for the total groups are fairly 
consistent across the five administrations. The best two variable model for all 
five administrations Is one that includes variable 1 (SAT-V scaled scores) and 
variable 2 (SAT-M scaled scores). The percentage of variance accounted for by 
this model ranges from 56% for data obtained at the June admlnlstrlon to 64% for 
data obtained at the May administration. 

The best three variable models specified for the total groups vary somewhat 
from administration to administration. All of the three variable models include 
variables 1 and 2. The third variable included in the model specified for the 
November, May and June administrations is variable 13 (grades in biology and 
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ficlences). The third vj’^'iable Included in the model specified for the December 
" ^ Tflnii^yy groups is Variable 7 (course work in biology and sciences^. 

--Chemistry Test: Summary Statistics 

Table 5 contains summary statistics bv grade level and administration for 
the Chemistry Test. As mentioned previously, it is important to note that the 
total group Includes some candidates who did not specify grade level. 

It is apparent, ^rom examination of the data present in Table 5, that the 
Chemistry Test is not typically taken by high school freshmen or sophomores at 
the November, December or January administrations. A reasonable number of 
sophomores appeared to have taken the test at the May administration. A similar 
number of freshmen took the test in June. Finally, similar to the pattern 
obtained for the Biology Test, high school sophomores consltltute a large 
proportion of the June test takers. 

An examination of the Achlevment Test scaled score means for the various 
groups indicates that, as has been observed for all the tests discussed so far, 
“^unlors score higher on the Chemistry Test than seniors at all five 
administrations. High school sophomores taking the test in May also score 
higher than seniors taking the test at this administration and just slightly 
lower than the May iunlors. The highest scaled score mean obtained by any group 
taking the test at the June administration was that obtained by students 
reporting their grade level as freshmen. High school sophomores scored the 
second highest on the test, followed by juniors and then by high school seniors. 

An examination of the total group means across the five administrations 
indicates that the most able group (as assessed bv Chemistry Test scores) is the 
group taking the test in June. The group with the second highest scaled score 
mean is the group taking the test in November* The lowest Chemistry scaled 
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score mean was obtained by the total group taking the test at the January 
administration. 

Table 6 contains SAT-V and SAT-M summary statistics by grade level for 
candidates taking the Chemistry Test at the five administrations. It should be 
noted, as was the case for the examinees taking the Biology Test, the number of 
freshmen and sophomores who took the test in May and June with appropriate SAT 
scores is small compared to the total number who actually took the Chemistry 
Test at these administrations. 

Considering only junior and senior scaled score means presented in Table 6, 
the same pattern that has been observed for the previous tests discussed is 
apparent, i.e., juniors score higher than seniors on both SAT-V and SAT-M for 
all five administrations. SAT-V scaled scores obtained by juniors range from a 
low of 528 to a high of 555. Those obtained by seniors range from a low of 476 
to a high of 545. SAT-M scaled score means obtained by juniors range from 606 
to 664 and those obtained by seniors range from 587 to 650. 

Sophomores who took the test at the Mav administration have a higher SAT-M 
mean than junior or senior candidates but a lower SAT-V mean than juniors taking 
the test at this administration. The highest SAT-M scaled score mean for the 
June administration was obtained by freshmen examinees and the second highest by 
sophomores. High school sophomores taking the test in June obtained the highest 
SAT-V mean, followed by freshmen, then juniors, and finally, seniors. 

An examination of the relationship between SAT-V and SAT-M scores (as 
assessed by the correlation coefficient) indicates that these correlations range 
from .41 for juniors taking the test in May to .68 for the small number of 
sophomores (N»25) also taking the test in Mav. 
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Chemistry Test; Correlational Analyses 

Table 7 contains correlation coefficients for Achievement Test Score vith 16 
of the 17 covariates for the total group taking the Chemistry Test at the five 
administrations* As mentioned previously, it should be noted that sample sizes 
varv from one pair of variables to another. Table I, found in the appendix, 
contains the correlation matrices summarized in Table 7. It should also be 
noted that no correlations of Chemistry Test scores with variable 17 
(Achievement Test Background Questionnaire responses) are given in Table 1 . 

This is because no Background Questionnaire is administered with the Chemlstrv 
Achievement Test. 

Examination of the data given In Table 7 indicates that, for the November 
administration, correlations of the 16 covariates with Chemistry Test Scores 
range from .63 for variable 2 (SAT— M scaled scores) to .02 for variable 4 
(amount of course work in English). Variable 1 (SAT-V scaled scores) has the 
second highest correlation with Chemistry Test scores. Other variables that 
show a correlation with the Chemistry Test scores of .30 or greater are 
variables 3, 8, 11, 13 and 14. These variables are, respectively, high school 
class rank, amount of course work in physical sciences, grades in mathematics, 
grades in biologv and sciences, and grades in physical sciences. 

An examination of the data given in Table 7 for the group taking the test at 
the December administration indicates a similar pattern to the one observed for 
the November correlations. The correlations with Chemistrv Achievement Test 
scores range from .68 with variable 1 (SAT-M scaled scores) to .02 with variable 
4 (amount of course work in English). Other variables correlating moderately 
with Chemistry Test scores are variables 1, 3, 11 ♦ 13 and 14. 
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Data obtained at the remaining three administrations show similar patterns 
to those observed for the November and December groups# In general, variable 2 
(SAT-M scaled scores) shows the highest correlation with Chemistry Test scores 
and variable 1 (SAT-V scaled scores), the second highest# Variables 3, 8, 11, 

13 and 14 show moderate correlations with Chemistry Test scores# 

Chemistry Test; Covarlate-Selectlon Analyses 

Results of the regression analyses carried out for the Chemistry Test are 
summarized In Table 8# As mentioned previously. It Is Important to note that 
the analyses were carried our for only those examinees who had scores on all the 
relevant variables# The render is referred to Table II, found in the appendix, 
for Intercorrelation matrices for all analysis groups# As has been the case for 
other tests, analyses are not presented for groups containing less than 100 
examinees# 

Examination of the information presented In Table 8 indicates that, for the 
sophomore June analysis, the best two variable model is one that Includes 
variable 1 (SAT-V scaled scores) and variable 2 (SAT-M scaled scores)# This 
model accounts for approximately 36% of the total variance# The best five 
variable model for sophomores taking the test In June appears to be one that 
Includes variables I, 2, 4, 8 and 11# (See Table 1 for a description of these 
variables#) The best five variable model fit to June sophomore data accounts 
^^or approximately 37% of the total variance# 

The best two variable model specified for juniors taking the Chemistry Test 
at the December and June administrations Is not consistent across the two 
administrations# For the December lunlor data, the best two variable model 
accounts for approximately 70% of the total variance and Includes variables 2 
(SAT-M scaled scores) and variable 10 (grades In English#) The best two 
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variable model for the June group includes variables 1 and 2 and accounts for 
approxitnately 73% of the total variance. 

The analvses carried out for the high school seniors taking the test 
Indicates that the best two variable model fit to data obtained at each of the 
five administrations is one that includes variables 1 and ?. The percentage of 
variance accounted for by this model ranges from 47% for the November group to 
5H% for the June group. 

4 

The best three variable model fit to the senior data is also consistent 
across the five test adm.lnlstrat ions . This model Includes variables 1 and 2, as 
well as variable 8 (amount of course work in physical sciences^. The percentage 
of variance accounted for by this model ranges from approximately 507 for data 
obtained from the November administration to 637 for seniors taking the test in 
June . 

The results of the analyses conducted for the total group were very similar 
to those obtained for the senior group, i.e., the best two variable model 
specified is consistent across the five administrations, and is one that 
contains variables 1 and 2. The best three variable model specified is 
consistent with the results obtained for the senior analyses with the exception 

the model fit to the June total group data. This model Includes variables 1, 
2, and 14 (grades received in phvsical sciences). 

Physics Test: Summary Statistics 

Summarv statistics for the Physics Test by grade level and administration 
are found in Table 5. Examination of the data presented in this table Indicates 
that the test is taken predominately bv seniors at the November, December and 
January administrations and by high school iunlors at the June administration. 
The test appears to be taken in Mav by similar numbers of juniors and seniors. 
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The only sizable freshmen group is the group taking the test at the vTune 
administration* This administration also has a fairly large number of 
sophomores taking the test. 

An examination of the Physics Test scaled score means for all groups 
containing 25 or more examinees indicates several trends. In general, as 
expected, high school iuniors score higher than high school seniors for all five 
administrations. The highest scoring group for the May administration contains 
those students reporting their grade level as sophomore. For the June adminis- 
tration, the highest scaled score means on the Physics Test were obtained bv the 
high school freshmen who take the test, the next highest by sophomores, followed 
by iuniors and then seniors. 

Perusal of the total group data summarized for the five administrations of 
the test indicates that the highest scaled score mean (616) was obtained by the 
total group taking the test at the November administration. It should be noted 
that this mean is very close to that obtained by the June total group (613). 

The lowest Physics Test mean obtained for a total group is 561, obtained by the 
Jantiary group. 

Table 6 presents SAT-V and SAT-M scaled score summary statistics for groups 
taking the Physics Test at the five administrations. Again, only summary 
statistics based on groups with 25 or more examinees are presented. For all 
five administrations, high school juniors score higher than high school seniors 
on both SAT-V and SAT-M. The range of SAT-V scaled score means for high school 
juniors is 505 to 562 and for high school seniors the range is 479 to 554. High 
sc^ ool junior SAT-M scaled score means range from 648 to 684. SAT-M means 
obtained by high school seniors range from 594 to 673. High school sophomores 
taking the test In June obtained higher scaled score means on both SAT-V and 
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SAT-M than either the juniors or the seniors taking the test at this 
administration . 

The relationship between SAT-V and SAT-M scaled scores varies across the 
different administrations and grade levels. The correlation between these two 
scores obtained by high school juniors ranges from .31 for the 34 candidates who 
took the Physics Test in December to .52 for the January and Mav groups. The 
correlation coefficient between SAT— V and SAT-M scores obtained by high school 
seniors ranges from a low of .44 for May candidates to a high of .50 for June 
cand idates . 

Physics Test ; Correlational Analyses 

Table 7 contains correlation coefficients for the Physics Test score with 17 
covarlates for the total group taking the test at the five administrations. 
Unlike the Biology and Chemistry Tests, a Background Questionnaire is adminis- 
tered with the Physics Test; hence, correlations of Physics Test score with 
variable 17 appear in Table 7. 

Examination of the data ftiven in Table 7 Indicates that » for the November 
administration, correlations of Physics Test scores with the 17 covariates range 
from a correlation of .62 with variable 2 (SAT-M scaled scores) to a correlation 
of .00 with variable 4 (amount of course work in English). The variable showing 
the second highest correlation with Physics Test scores (.50) is variable 1 
(SAT-V scaled scores). The variable that correlates the third highest with 
Physics Test scores (.30) is variable 8 (amount of course work in physical 
sciences) . 

The results of the analyses given in Table 7 for the December Physics Test 
administration show similar patterns in correlations to those observed for the 
November data. Again, variable 2 (SAT-M scaled scores^ correlates most highly 
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(.63) with Physics Test scores. Other variables showing moderate correlations 
with Physics Test scores are variables 1, 8, 14 and 17, which are respectivelv, 
SAT-V scaled scores, amount of course work in physical sciences, grades in 
physical sciences, and Achievement Test Background Questionnaire responses. 

Data obtained at the January, May, and June Physics Test administrations 
show similar patterns to those observed for the two administrations just 
discussed. In all cases, the variable that correlates most highly with Physics 
Test scores is variable 2 (SAT-M scaled scores) . Other variables that are 
somewhat consistent in showing correlations of .30 or greater with Physics Test 
scores are variables 1,8, 14, and 17. A few exceptions are worth noting; 1) 
variable 17 (Achievement Test Background Questionnaire responses) does not 
correlate as highly with Physics Test scores obtained at the May and June 
admlnl^.trations as it does with data obtained from the fall administrations; and 
2) variable 3 (high school class rank) and variable 11 (grades in mathematics) 
show moderate correlations with Physics Test scores for the groups taking the 
test at the spring administrations. 

Physics Test: Covariate-Selection Analyses 

Results of the Physics Test regression analyses are presented in Table 8. 

As mentioned previously, analyses are not presented for groups with sample sizes 
of fewer than 100 cases. The results of the analyses presented in Table 8 
Indicate that the best two variable model for juniors taking the test at both 
the Mav and June administrations is one that contains variables 1 and 2 (SAT-V 
and SAT-M scaled scores, respectively). This model accounts for approximately 
52% of the total variance for both the Mav and June analyses. The best three 
variable model is also consistent for juniors taking the test at the two 
administrations. This model is one that Includes variables 1 and 2, as well as 
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variable 8 (amount of course work In physical sciences.) This model accounts 
for approximately 54% of the variance in the May data and 53% of the variance 
for the June analysis. 

The analyses conducted for the high school seniors taking the Physics Test 
at the five administrations show a great deal of consistency across the five 
administrations. The best two variable model for all five administrations is 
one that includes variables 1 and 2. This model accounts for a range of 43% to 
52% of the total variance in Physics Test score. The best three variable model 
specified for high school seniors taking the Physics Test is also consistent 
across the five administrations. This model Includes variables 1 and 2, and 
also variable 17 (Achievement Test Background Questionnaire responses). The 
best three variable model accounts for a range of 47% to 53% of the total 
variance In Physics Test score. 

The results of the total group analyses for the Physics Test are quite 
similar to those obtained for the senior data. Variables 1 and 2 constitute the 
best two variable model for all administrations. The results indicating the 
best three variable model are not quite consistent with those obtained for the 
senior data. For the total group analyses, the best three variable model 
obtained for the fall administrations is one that includes variable 1 fSAT-V 
scaled scores), variable 2 (SAT-M scaled scores), and variable 17 (Achievement 
Test Background Questionnaire responses). The best three variable model for 
data obtained for the spring administrations is one that includes variables 1 
and 2, as well as variable 8 (amount of course work in physical sciences). 

French Test; Summary Statistics 

Table 5 contains summary statistics bv grade level and administration for 
the French Test. Examination of the information presented in Table 5 indicates 
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that the French Test Is taken predominately by high school juniors and seniors* 
High school seniors appear to take the test mostly at the November, December, 
and January administrations and juniors make up the largest proportion of test 
takers at the May and June administrations. A considerable number of high 
school sophomores and a small number of freshmen also take the test at the June 
administration. 

An examination of the Achievement Test scaled score means for the various 
grade level groups indicates that, for the November administration, high school 
seniors score sllghly higher than juniors; however, for the remaining four 
administrations, the situation Is reversed with juniors being the higher scoring 
group. High school sophomores (who took the test In May) score lower than 
juniors and higher than seniors who took the test at that same administration. 

For the June administration data, scaled score means are arranged high to low by 
grade level as follows: juniors, freshmen, sophomores, and finally, seniors. 

An examination of the total group means for the five administrations 
Indicates that these means range between a high of 555 for the group that took 
the test at the June administration to a low of 508 for the January group. The 
means obtained by the November, December, and May total groups are all fairly 
similar, ranging between 526 and 538. 

Table 6 contains SAT-V and SAT-M summary statistics by grade level for 
candidates taking the French Test at the five administrations. Scaled score 
means obtained by seniors across the five administration groups range from a low 
of 529, obtained by the May group, to a high of 558 (obtained at both the 
November and June administrations). SAT-M means for this group range from a low 
of 555, again for the May group, to a high of 593 obtained by the November 
group. Correlations between scores obtained on SAT-V and SAT-M by high school 




seniors range from .52 to *68* 
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SAT-V scaled score means obtained by juniors taking the French Test at the 
four administrations containing groups with 25 or more examinees range from a 
low of 5A7 for the June group to a high of 575 for the group taking the test in 
December. SAT-M scaled score means obtained bv this same group range from a low 
of 581 obtained at the June administration to a high of 613 obtained by those 
students taking the test In January. Th6 correlation between SAT-V and SAT-M 
scores obtained by high school juniors ranges between .58 and .62. 

It is Interesting to note that high school sophomores taking the French Test 
in June (the only group of sophomores containing 25 or more candidates) have 
higher scores on both SAT-V and SAT-M than juniors or seniors taking the test at 
this administration. The correlation between SAT-V and SAT-M scores obtained by 
sophomores is also quite high (.63). 

Fren ch Test: Correlational Analyses 

Table 7 contains correlation coefficients for French Achievement Test score 
with 17 covarlates for the total group taking the test at the five 
administrations. As mentioned previously, sample sizes varv from one pair of 
variables to another. The reader is referred to Table I. found in the appendix, 
for sample sizes for the correlations summarized in Table 7. It should be noted 
that a Background Questionnaire is administered with the French Test; thus, 
correlations with all 17 covarlates have been obtained for this test. 

Examination of the data given In Table 7 indicates that, for the November 
administration, correlations with French Test scores range from -.01 for 
variable 7 (amount of high school course work In biology and sciences) to .58 
for variable 1 (SAT-V scaled scores). Variable 17 (Achievement Test Background 
Questionnaire responses) correlates the second most hlghlv with French Test 

scores. 
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An examination of the correlational data given in Table 7 for the December 
administration indicates a similar pattern to the one observed for the November 
correlations. Scores on SAT-V correlate most highly with French Test scores and 
Achievement Test Background Questionnaire responses (variable 17) correlate 
second highest. The third most highly correlating variable is variable 2 (SAT-M 
scaled scores). Other variables showing correlations of .30 or greater are 
variables 6 and 12 (course work in foreign languages and grades in foreign 
languages, respectively). 

Data examined for the January, May, and June administrations show similar 
trends to those just described. In all cases, the variable correlating the most 
highly with French Test score is variable 1 (SAT-V scaled scores). Variables 
that consistently show correlations of .30 or greater with French Test scores 
are variables 2, 17, 6 and 12. Variable 3 (high school class rank) also shows a 
moderate correlation with French Test scores for the May and June groups. 

French Test: Covariate-Selection Analyses 

Results of the regression analyses carried out for the French Test are 
summarized in Table 8. Examination of the information presented in Table 8 
indicates that the best two variable model for juniors taking the French test in 
May and June is one that Includes variable 1 (SAT-V scaled scores) and variable 
17 (Achievement Test Background Questionnaire responses). This model accounts 
for 47% of the total variance for the May group and 41% of the total variance 
for the June group. A considerable increase in the percentage of variance 
accounted for by the models fit to the junior data can be accomplished by 
expanding the models from two to four variables. Examination of the information 
in Table 8 indicates that the best four variable model for the May and June 
administrations of the test to this grade level group is one that Includes 
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variables 1, 2, 17, and 12, which are SAT-V scaled scores, SAT-M scaled scores. 
Achievement Test Background Questionnaire responses, and grades in foreign 
languages, respectively. The four variable model accounts for 53% of the total 
variance found in the May data and 46% of the total variance in the June data. 

The regression analyses carried out for the high school seniors taking the 
test at the five administrations show some similarity in that the best two 
variable model is one that consistently employs variables 1 and 17 (SAT-V scaled 
scores and Achievement Test Background Questionnaire responses) . The best four 
variable model differs somewhat across the five administrations. This model 
includes variables 1, 2, and 17 for four of the five administrations and either 
variable 6 or 12 (course work in foreign languages and grades in foreign 
languages, respectively), as the remaining fourth variable. The best four 
variable models account for between 40% to 55% of the total variance for the 
analyses carried out across the five administrations. 

Finally, the analvses carried out for the total groups taking the French 
Test at the five administrations produced results similar to those obtained 
using the senior data. In all cases, the best four variable model is one that 
includes some combination of variables 1, 2, 17, 6 and 12. These models account 
for a range of 40% to 53% of the total variance in Achievement Test score across 
the five administrations. 

Spanish Test; Summary Statistics 

Achievement Test summary statistics for the Spanish Test are found in Table 
5. An examination of these data indicates that the Spanish Test (similar to the 
French Test) is taken predominately by high school juniors and seniors, although 
sophomores do constitute a fatrlv large proportion of the June test takers. 
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Similar to the other tests discussed so far, high school juniors obtain 
higher average scores than high school seniors at all five administrations. 
Sophomores, taking the test in May, score lower than juniors taking the test at 
the same administration, but higher than Mav seniors. The highest scaled score 
mean obtained by any group at the June administration was that obtained bv high 
school freshmen. The second highest mean for this administration was obtained 
by juniors, followed by sophomores, and finally, by seniors. 

An examininatlon of the total group means for the Spanish Test given at the 
five administrations indicates that these means range between a high of 535 for 
the June group and a low of 485 for students who took the test in January. 

Summary statistics for SAT-V and SAT-M scores obtained by examinees taking 
the Spanish Test at the five administrations are shown in Table 6. As has been 
the case for previous discussions, summary statistics based on fewer than 25 
candidates are not presented. 

Perusal of the information found in Table 6 Indicates that SAT-V means 
obtained bv senior candidates range from a low of 477 for the May administration 
group to a high of 515 obtained by both the June and November administration 
groups. SAT-M means for this group range from a low 528 for the May group to a 
high of 569 for the November group. The correlations between SAT-V and SAT-M 
scores obtained by high school seniors taking the Spanish Test range from .57 to 
. 68 . 

SAT-V scaled score means for juniors taking the Spanish Test range from a 
low of 514 (for those students taking the Spanish Test in June) to a high of 546 
for the December group. SAT-M means for those reporting their grade level as 
juniors ranged from a low of 563 for June juniors to a high of 600 for December 
juniors. Correlations between SAT-V and SAT-M scores obtained by juniors range 





between .58 and .68. 



- 69 - 



High school sophomores taking the Spanish Test in June received higher SAT-V 
and SAT-M scores than either the juniors or seniors taking the test at this 
administration. The correlation between the scores on these two variables 
obtained by June sophomores was .44. 

Spanish Test! Correlational Analyses 

Table 7 contains correlation coefficients for Spanish Test score with the 17 
proposed covariates for scores obtained bv the total group taking the test at 
the five administrations. It should be noted that, similar to the French Test, 
a Background Questionnaire Is administered with the Spanish Test; thus, 
correlations with all 17 covariates appear in Table 7. 

Review of the information presented in Table 3 Indicates that, for the 
November administration, the variable correlating most highly with Spanish Test 
score is variable 1 (SAT-V scaled scores). Variable 17 (Achievement Test 
Background Questionnaire responses) correlates second most highly and variable 2 
(SAT-M scaled scores) third most highly. Other variables showing correlations 
of .30 or greater are variables 6 and 12 (course work in foreign languages and 
grades in foreign languages, respectively). 

December data, presented in Table 7, show a similar pattern of correlation 
coefficients. In this case, the variable correlating most highly with Spanish 
Test scores is variable 17 (Achievement Test Background Questionnaire 
responses). The next highest correlating variable is variable 1, followed bv 
variable 2. 

Data for the remaining administrations displayed in Table 7 show similar 
patterns of correlation coefficients. Achievement Test Background Questionnaire 
responses (variable 17/ shows consistently high correlations with Spanish Test 
scores, as do variables 1 and 2* It should be noted that, for the January data. 




the only variable displaying a correlation greater than .30 is Achievement Test 
Background Questionnaire responses. 

Spanish Test; Covariate**Selection Analyses 

Results of the regression analyses carried out for the Spanish Test are 
summarized in Table 8. As mentioned previously, only those analyses conducted 
on groups with sample sizes of 100 or greater will be discussed. Examination of 
the Information presented In Table 8 for the high school juniors taking the test 
at the Mav and June administrations Indicates that the best two variable model 
specified for both these administrations is one that Includes variable 1 (SAT-V 
scaled scores) and variable 17 (Achievment Test Background Questionnaire 
responses) . The best four variable model Is also consistent across the two 
administrations. This model Includes variables 1, 2, 17, and 12, which are 
SAT-V scaled scores, SAT-M scaled scores, Achievement Test Background 
Questionnaire responses, and grades in foreign languages, respectively. This 
model accounts for 47% of the total variance in Achievement Test scores for the 
May administration and 43% of the total variance for the June administration. 

The best two variable model speclfed for the senior data is not consistent 
across the five administrations; however, all the models Include some 
combination of variables 1, 17, and 12. The best four variable model specified 
for seniors is also inconsistent across the five grade levels. The variables 
that do appear consistently are variable 1 (SAT-V scaled scores), variable 17 
(Achievement Test Background Questionnaire responses), and variable 12 (grades 
In foreign languages). The remaining fourth variable specified is either 
variable 2 (SAT-M scaled scores), variable 21 (parental Income), variable 5 
(course work In mathematics), or variable 19 (father’s level of education). 

This model accounts for a range of between 26% to 46% of the total variance In 



Spanish Test scores 
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The results of the analyses carried out for the total group taking the 
Spanish Test at the five administrations were slightly more consistent than 
those obtained using only senior data. The best four variable model specified 
for the total group taking the test at the November, May, and June 
administrations was one that contains variables 1, 2, 17, and 12. This model 
accounted for a range of 39% to 42% of the total variance in Spanish Test 
scores. The best four variable model for the total group taking the test in 
December and Januarv is one that Includes variables 1, 17, 12, and 21 (parental 
Income) . 

European History and World Cultures Test; Summary Statistics 

Summary statistics by grade level and administration for the European 
Historv and World Cultures Test can be found in Table 5. It should be noted 
that the European History and World Cultures Test was only offered at the 
December and May administrations at the time data were collected for this study. 
Beginning with testing year 1986-87, the test is offered only at the December 
and June administrations. Examination of the data presented in Table 5 
indicates that the test is taken predominately by high school seniors in 
December and iunlors in May. A considerable number of sophomores also take the 
test in May. 

It is fairly clear, from an examination of the information presented in 
Table 5, that high school iunlors are consistently the highest scoring group 
that take the test, scoring higher even than the sophomores who take the test in 
May. It is interesting to note that there is not a large difference between the 
achievement levels (as assessed by European History Test scores) of the May and 
December groups. The May group scores somewhat higher than the December group, 
and this is consistent for both iunlors and seniors. 
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Table 6 presents SAT-V and SAT-M sumnary statistics for grade level groups 
taking the European History Test at the two administrations. Again, no 
statistics based on groups containing fewer than 25 candidates are presented. 

As has been the case for achievement test groups discussed previously, iunlors 
score higher than seniors on both SAT-V and SAT-M. Tt is Intereslng to note 
that both juniors and seniors taking the test in December appear to be 
considerably more able (as assessed by SAT-V scores) than their May 
counterparts. However, SAT-M scores obtained by both juniors and seniors are 
quite slmlllar across the two administrations. The correlations between SAT-V 
and SAT--M scores obtained by juniors and seniors at the two administrations 
range from a low of .41 obtained by May seniors to a high of .57 obtained by May 
juniors. 

European History and World Cultures Test: Correlational Analyses 

Table 7 presents correlations of European History and World Cultures Test 
scores with 16 of the 17 proposed covarlates for the total group taking the test 
at the December and May administrations. Because no Background Questionnaire is 
administered with the European History Test, correlations of test scores with 
variable 17 are not presented in Table 7. 

An examination of the data given in Table 7 for both the December and May 
administrations Indicates similar patterns of correlations for data obtained at 
these two administrations. For both administrations, the variable correlating 
most highly with European History. Test score is variable 1 (SAT-V scaled 
scores). The variable correlating second most highly, for both administrations, 
is variable 2 (SAT-M scaled scores). The third most highly correlating variable 
(across the two administrations) is variable 15 (grades in social studies). 

Other variables that correlate moderately (correlations of .30 or greater) with 
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Achievement Test score are variables 10 and 13 (grades in English and grades In 

biology and sciences). 

I 

I 

European History and World Cultures Test; Covariate-Selection Analyses 

Table 8 contains the results of the regression analyses carried out for the 
European History and World Cultures Test. The results of the analyses conducted 
for the junior May group indicate that the best two variable model is one that 
includes variables 1 and 2 (SAT-V and SAT-M scaled scores) • This model accounts 
for 51% of the total variance in Achievement Test score. The best three 
variable model fit to the May junior data is one that Includes variables 1, 2, 
and ]5 (grades in social studies). This model accounts for an additional 3% of 
the total variance. 

The best two and three variable models f"*r seniors taking the test at the 
May and Decem.ber administrations is not consistent across the two 
administrations. The best two variable model for the December group is one that 
includes variables 1 and 2 , and this model accounts for 49% of the total 
variance. The best three variable model for seniors taking the test in December 
includes variables 1, 2, and 10 (grades in English) and accounts for an 
additional 1% of the total variance. For senior data obtained at the May 
administration, the best two variable model is one that Includes variables 1 and 
7 (amount of course work in biology and sciences). This model accounts for 
approximately 66% of the total variance. The best three variable model 
specified for Mav seniors (which accounts for an additional 1% of tfie total 
variance) is one that Includes variables 1, 7, and 9 (amount of course work in 
social studies). 
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The analyses carried out for the total group taking the test In December and 
May are somewhat consistent in that the best two variable model specified for 
both administrations Is one that Includes variables 1 and 2. The best three 
variable model specified for the December group (with 49% of the variance 
accounted for) is one that specifies variables 1, 2, and 10 (grades In English). 
For the May total group, the best three variables model Is one that Includes 
variables 1, 2, and 15 (grades In social studies). This model accounts for 
approximately 59% of the total variance in European History and World Cultures 
Test scores. 

German Test; Summary Statistics 

Summary statistics by grade level and administration for the German Test are 
found in Table 5. It should be noted that, as was the case for the European 
Mstory and World Cultures Test prior to testing year 1986-87, the German Test 
was given only at the December and May administrations. Examination of the 
information found in Table 5 indicates that the test is taken predominately by 
seniors in December. A fairly similar number of juniors and seniors take the 
test in May. This administration also includes a small number of sophomores. 

The most able group taking the test (as assessed by German Test score) appears 
to be high school iunlors. This is true for both the May and December 
administrations. Sophomores taking the test in May are more able than seniors, 
but not as able as juniors. In general, both groups (juniors and seniors) 
taking the test in December score considerably higher than their May 
counterparts. 

Table 6 contains summary statistics by grade level and administration for 
candidates taking the German Test. The only summary statistics based on 25 or 
more candidates are for seniors taking the test in December and May and juniors 
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taking the test in May. Scaled score means obtained for both SAT-V and SAT-M 
for seniors taking the German Test in December are considerably higher (562 and 
598, respectively) than for seniors taking the German Test in May (531 and 576, 
respectively). The correlation between SAT-V and SAT-M scores for the two 
administrations are .56 for the December administration and .54 for the May 
administration. The scaled score means for SAT-V and SAT-M obtained by the 
juniors taking the test in May are very similar to those obtained by the seniors 
taking the test in December (565 and 598) but considerably higher than those 
obtained by seniors taking the test in May. In general, the total group means 
on both SAT-V and SAT-M are higher for the December group than for the May 
group. 

German Test: Correlational Analyses 

Correlation coefficients for German Test score with 17 propoired covariates 
are shown in Table 7. It should be noted that a Background Questionnaire is 
administered with the German Test; hence, correlations of Achievement Test score 
with variable 17 appear in Table 7. 

An examination of the data given in Table 7 for both the December and May 
administrations indicates that the patterns of correlations of the covariates 
with German Test score are fairly similar for both administrations. The 
correlations with all of the covarlates are the lowest that have been discussed 
so far. The covariate correlating most highly with German Test scores obtained 
at the December administration Is variable 1 (SAT-V scaled scores); this is 
followed by variable 17 (Achievement Test Background Questionnaire responses). 
Variable 6 (course work in foreign languages) Is the only other variable with a 
correlation of .30 or greater with German Test score. It should be noted that 
variable ? (SAT-M scaled scores) has a correlation of .29 with German Test 
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For the May data, the most highly correlating variable Is variable 17 
(Achievement Test Background Questionnaire responses). This Is followed by 
variable 6 (course work In foreign languages) . Variables 1 and 2 (SAT-V and 
SAT— M scaled scores) show correlations of .29 and .27 with German Test scores. 

German Test; Covarlate Selection Analyses 

Results of the regression analyses carried out for the German Test are 
summarized In Table 8. Analyses employing the May junior data Indicate that the 
best two variable model Is one that contains variable 2 (SAT— M scaled scores) 
and variable 17 (Achievement Background Questionnaire responses). This model 
accounts for appoxlmately 26% of the total variance In German Test score. The 
best four variable model for this group (which accounts for approximately 31% of 
the total variance) Is one that Includes variables 2, 17, and 7 (amount of 
course work In biology and sciences) and variable 21 (parental Income). 

The best two variable model for seniors taking the test In December Is one 
that Includes variables 1 and 17, and for those taking the test In May, Is one 
that Includes variables 2 and 17. The best four variable model for December 
seniors (accounting for approxlmatley 30% of the total variance) Is one that 
Includes variables 1, 17, and 6 (amount of course work In foreign languages) and 
12 (grades In foreign languages). For the May seniors, the best four variable 
model (which accounts for 43% of the total variance) Is one that Includes 
variables 1, 2, 17, and 8 (amount of course work In physical sciences). 

The results obtained for the December total group are Identical to those 
obtained for the December seniors. Those obtained for the May total group are 
slightly different than the results obtained for the Mav seniors In that the 
best four variable model Is one that Includes variables 2, 17, 6, and 8. This 
model accounts for 30% of the total variance In German Test score for the 
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December administration and 37% of the total variance for scores obtained by the 
May total group. 

Hebrew Test: Summary Statistics 

Table 5 contains summary statistics by grade level and administration for 
the Hebrew Test. It should be noted that» similar to the German Test and the 
European History and World Cultures Test prior to testing year 1986-87, the 
Hebrew Test was offered only at the December and May Achievement Test 
administrations. Perusal of the information contained in Table 5 indicates that 
the maior administration of the test is the one carried out in December. 

Seniors constitute the largest proportion of this population. A small number of 
students also take the test in May and these students are predominately high 
school iunlors. It should be noted that iunlors score higher than seniors at 
both the December and May administrations; also, the scores obtained by the 
total groups for both administrations are fairly similar. 

Summary statistics for SAT— V and SAT— M scores obtained by Hebrew Test takers 
are found in Table 2. Very few of the summary statistics (those obtained by 
seniors in December and by the total group in December and May) are based on 
groups of 25 or more examinees. Both SAT-V and SAT-M total group means are 
higher for the group taking the test in December (568 and 583, respectively) 
than for the total group taking the test in May (513 and 553, respectively). 

The December total group SAT-V and SAT— M means are also higher than those 
obtained by seniors at that same administration (568 and 583, respectively). 
Correlations between SAT-V and SAT-M scores range from a low of .48 for the May 
total group to a high of .69 for the December total group. 
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Hebrew Test: Correlational Analysis 

Table 7 contains correlation coefficients for the Hebrew Test score with 16 
of the 17 covariates. Although a Background Questionnaire is administered with 
the Hebrew Test, it is currently undergoing extensive revision so it will be 
Ignored for these analyses. 

Perusal of the data obtained in Table 7 indicates that, for both the 
December and May administrations, the variable correlating most highly with 
Hebrew Test score is variable 1 (SAT-V scaled scores). The variable correlating 
next highest (for data from the December administration) is variable 13 (grades 
in biology and sciences). This is followed, in order, by variables 12, 2, and 3 
(grades in foreign languages, SAT-M scaled scores, and class rank, 
respectively) . 

The analyses conducted using data from the May administration indicate that 
variables ? and 5 (SAT-M scaled scores and amount of course work in mathematics) 
tie for the second most highly correlating variables. These variables are 
followed in order by variables 12, 13, 14, 11, 8, and 10 which are respectively, 
grades in foreign languages, grades in biology and sciences, grades in physical 
sciences, grades in mathematics, amount of course work in physical sciences, and 
grades in English. 

Hebrew Test: Covariate-Selection Analyses 

Results of the regression analyses conducted for the Hebrew Test are 
summarized In Table 8. The analyses carried out for December seniors (N*59) and 
the December total group (N*64) will be discussed In spite of the few number of 
cases they are based on. 

Perusal of the data presented in Table 8 Indicates that the best two 
variable model for the high school seniors taking the Hebrew Test in December is 
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one that includes variable 1 (SAT-V scaled scores) and variable 19 (father’s 
level of education) • This model accounts for approximately 35% of the total 
variance in Hebrew Test score. The best four variable model specified for high 
school seniors taking the test in December accounts for approximately 52% of the 
total variance and includes variable 2 (SAT-M scaled scores), variable 17 
(responses to the old version of the Achievement Test Background Questionnaire, 
variable 12 (grades in foreign languages), and variable 19 (father’s level of 
education) • 

The best two variable model for the total group taking the test in December 
is one that includes variables 2 and 19. The best four variable model for this 
group, which accounts for approximately 47% of the total variance in Hebrew Test 
scores is, as was the case for the senior analyses, one that Includes variables 

2, 17, 12, and 19. 

\ 

Latin Test: Summary Statistics 

Achievement Test summary statistics by grade level and administration for 
the Latin Achievement Test are found in Table 5. Tt should be noted (as is was 
for the German, Hebrew and European History and World Cultures Test) that prior 
to testing year 1986-87, the Latin Test was offered at only the December and May 
Achievement Test administrations. 

Examination of the data found in Table 5 for the Latin Test indicates that 
the test is taken mostly by seniors in December and by high school juniors in 
May; however, a fairly large number of sophomores, and a sufficient number of 
freshmen (for the purposes of this discussion), took the test in May. A closer 
look at the scaled score means presented in Table 5 indicates that juniors 
taking the test in December obtain an unusually high scaled score mean (630), 
when compared to that obtained by December seniors (550) or May juniors or 
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seniors (573 and 524, respectively). High school sophomores taking the test in 
May score higher on average (588) than either the juniors or seniors taking the 
test. High school freshmen, taking the test In May, achieve a higher mean (567) 
than the seniors taking the test, but a lower mean than either the sophomores or 
juniors taking the test. The total group mean obtained for the May group Is 
higher than that obtained In December, reflecting the highly able freshmen and 
sophomores who take the test at this administration. 

SAT-V and SAT-M summary statistics obtained by students taking the Latin 
Achievement Test are found In Table 6. As mentioned previously, only those 
statistics based on 25 or more candidates are presented. Scaled score means on 
both SAT-V and SAT-M obtained by seniors taking the test In December are quite 
similar to those obtained by the group taking the test In May. For SAT-V, the 
means are 563 and 561, respectively, and for SAT-M they are, respectively, 594 
and 593. Correlations between SAT-V and SAT-M scores obtained by this group are 
quite high for both administrations, (.55 for the December senior data and .63 
for the May senior scores) . 

SAT-V and SAT-M scaled scores obtained by the juniors taking the test In 
June are considerably higher than those obtained by the high school seniors. 

The total group mean for SAT-V and SAT-M scores are higher for the total group 
taking the test In May than for the total group taking the test in December. 

Latin Test! Correlational Analyses 

Correlations of the Latin Test score with 17 covarlates are summarized In 
Table 7. An Achievement Test Background Questionnaire Is administered with the 
Latin Test; hence, correlations of Achievement Test scores with variable 17 
appear In Table 7. 
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An examination of the information presented in Table 7 indicates that the 
patterns of correlation coefficients of the Latin Test score with the 17 
covarlates are fairly similar for the December and May administrations. The 
variable correlating most highly with Latin Achievement Test scores, for both 
administrations, is variable 1 (SAT-V scaled scores). Variable 2 (SAT-M scaled 
scores) is the variable that correlates the second most highly. The correlation 
of Achievement Test Background Questionnaire responses (variable 17) with Latin 
Test scores is dissappointly low for both the December and May data (.29 and 
•26, respectively). Other variables showing correlations of .30 or greater 
(examining both the December or May data) are variables 3, 6, 11, 12, 13, and 
14. These variables are, respectively, class rank, amount of course work in 
foreign languages, grades in mathematics, grades in foreign languages, grades in 
biology and sciences, and grades in physical sciences. 

% 

Latin Test; Covarlate-Selectlon Analyses 

Results of the regression analyses carried out for the Latin Test are 
suiranarlfied In Table 8. The data presented In Table 8 Indicate tliat the best two 
variable model for May iunlor candidates Is one that Includes variable 1 (SAT-V 
scaled scores) and variable 12 (grades In foreign languages). This model 
accounts for approximately 30% of the total variance In Latin Test score. The 
best four variable model for this group Is one that Includes variables 1, 2, 17, 
and !2 (SAT-V scaled scores, SAT-M scaled scores. Achievement Test Background 
Questionnaire responses, and grades In foreign languages, respectively). 
Approximately 36% of the total variance In Latin Test score Is accounted for by 
this model. 

The best two and four variable models fit to the senior data are not 
consistent across the two administrations. For the December administration, the 
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best two variable model Is specified to be one that Includes variables 1 and 2. 
The best two variable model specified for the May data is one that Includes 
variables 2 and 17. 

The best four variable models fit to the senior data collected at the two 
administrations Is one that Includes variables 1, 2, and 17 (SAT-V scaled 
scores, SAT-M scaled scores, and responses to the Achievement Test Background 
Questionnaire, respectively). The fourth variable Included for the December 
data is variable 12 (grades In foreign languages). The fourth variable Included 
for the May data Is variable 3 (class rank). The percentage of variance 
accounted for by these two models Is approximately 42% for the data collected at 
the December administration and 54% for the May data. 

The results of the total group analyses are consistent across the December 
and May administrations. The best two variable model specified for both 
administrations Is one that contains variables 1 and 2 (SAT-V and SAT-M scaled 
scores). The best four variable model Is one that contains variables 1, 2, 17, 
and 12 (SAT-V scaled scoret, , SAT-M scaled scores. Achievement Test Background 
Questionnaire responses, and grades In foreign languages). This model accounts 
for approximately 427 of the total variance In the December data and 43% of the 
total variance In the May data. 

Across Test Comparisons; Summary Statistics 

Data comparing Achievement Test summary statistics are found In Table 5 and 
Figure 1. Figure 1 contains plots of total group Achievement Test scaled score 
means for the 10 Achievement Tests that are given at all five administrations. 
No plots of Achievement Test means were made for the four tests that are 
administered only twice a year. The data given In Table 5 will be used to make 
comparisons for these tests. 
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Several observations can be made based on an examination of the data 
presented in Table 5. First, it is generally true, for the 10 tests given at 
all five administrations, that the majority of students taking the tests at the 
November, December, and January administrations are high school seniors and 
those taking the tests at the June administration are high school juniors. The 
one exception to this rule Is the Biology Test, which is taken predominately In 
June by sophomores. It should be noted that the June Chemistry Test Is also 
taken by a large number of sophomores; however, juniors still make up the 
maiorltv of the June test taking population. 

Considering only those tests administered at all five administrations, the 
grade level composition of the May administration Is not as consistent across 
tests as the grade level composition observed for the other four 
administrations. For most of the tests, the majority of May test takers are 
high school juniors. However, for some tests, almost equal numbers of juniors 
and seniors participate In the May administration. These tests are Literature, 
Mathematics Level IT, and Physics. 

An Interesting point to note, when examining the information presented In 
Table 5 for the tests administered five times a year. Is that the largest volume 
administration for most tests Is the December administration. However, the 
science tests (Biology, Chemistry and Physics) and the American History and 
Social Studies Test are administered to more candidates In June than In 
December. This appears mainly to be due to the large number of sophomores that 
choose to take these tests In June. 

The Information presented In Table 5 for the Achievement Tests administered 
only twice a year exhibits similar trends to those observed for the tests 
administered at all five administrations. Most of the tests administered only 
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twice a year are taken predominately by seniors at the December administration 
and by juniors at the May administration. It should be noted that high school 
seniors who took the German Test in May make up a larger proportion of the 
population than do May seniors taking the remaining tests that are administered 
only twice annually. 

Any comparison of the achievement levels of the various groups (as assessed 
by scaled scores on the Achievement Tests’) must be made with caution. Scores 
obtained for the different tests may not necessarily be on the same scale and 
should only be considered roughly comparable. Comparisons within a test, across 
administrations, can be made with confidence since scores on different forms of 
the same test which are given at the various administrations have been equated. 

An examination of the total group means for the 10 tests given five times 
annually, which are plotted in Figure 1, indicates that, in general, the most 
able groups that take the tests are those students who take the tests in June 
and the least able groups are those who take the tests in January. Reference to 
Table 5 Indicates that the Achievement Tests that are given only twice a year 
show a similar pattern, l.e., total group means obtained by groups taking the 
tests in May are higher than those obtained by examinees taking the tests in 
December. 

Inser: Figure 1 about here 

Considering only the scaled score means obtained on the tests given five 
times a year, the following observations can be made. A comparison of the 
scores obtained at the two largest administrations (December and June) shows 
that the mean scores on the tests rank order high to low in a somewhat similar 
manner. This ranking is given below: 
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December Administration June Administration 



Test 


Ach. Test 
Mean 


Test 


Ach. Test 
Mean 


Mathematics Level II 


646 


Mathematics Level II 


677 


Physics 


578 


Physics 


613 


Chemistry 


556 


Chemistry 


585 


Mathematics Level I 


532 


Biology 


562 


Literature 


528 


French 


555 


French 


526 


American History 


555 


English Composition 


522 


Mathematics Level T 


553 


Biology 


517 


Spanish 


535 


American History 


505 


English Composition 


528 


Span! sh 


497 


Literature 


526 



Although one must take Into account the fact that scaled score means 
obtained on the different tests are only roughly comparable, it seems reasonable 
to conclude that the most able groups taking the tests at the December and June 
administrations are examinees taking the Mathematics Level II, Physics, and 
Chemistry Tests, with the Mathematics Level II group considerably more able than 
examinees taking the other two tests. It also seems reasonable to conclude 
that, general Iv, those examinees taking the English Composition Test, American 
History and Social Studies Test, and the Spanish Test are among the least able. 

It is interesting to add to this comparison, a comparison of the scaled 
score means obtained on the tests that are administered only twice a year. For 
convenience, these are listed below In order from high to low. 



December Administration 


Test 


Ach. Test 
Mean 


Hebrew 


625 


Latin 


555 


European History 


542 


German 


542 



Mav Administration 



Test 


Ach. Test 
Mean 


Hebrew 


632 


Latin 


570 


European History 


557 


German 


542 
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It is clear that the groups taking the Hebrew Test are the most able (as 
assessed by the Hebrew Test scaled score means). It is also fairlv clear that 
the groups taking the European History and World Cultures Test and the groups 
taking the German Test are less able than examinees taking the Latin Test. 

SAT-V and SAT--M scaled score summary statistics for candidates taking the 
Achievement Tests are found in Table 6. Figure 7 contains plots of SAT-V scaled 
score means for the total group taking the 10 Achievement Tests that are given 
fi'^e times annually. Figure 3 displays the same data for SAT-M scaled score 
means. As was the case for the Achievement Test scaled score means previously 
discussed, plots were not obtained for the small volume Achievement Tests that 
are given only twice a year. 



Insert Figures 2 and 3 about here 

An examination of the information given in Table 6 and Figure 2 indicates 
that, in general, higher SAT-V scaled score means are obtained by the total 
groups taking the tests in June and November than those taking the tests in 
January. In manv instances, the November and June SAT-V means for the total 
group taking a particular test are quite similar. Most of the SAT-V scaled 
score means obtained by the May groups are similar to, but somewhat higher than, 
those obtained by the January groups. For most of the tests, the December SAT-V 
scaled score means obtained by the total groups are lower than those obtained by 
the November and June groups and higher than those obtained by the January and 
Mav groups. Given below is a rank ordering of the 10 tests (from high to low) 
based on SAT-V scaled score means for the two largest volume administrations 
(December and June). 
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December Administration 


^ June Administration 




SAT-V 




SAT-V 


Tesf 


Mean 


Test 


Mean 


Mathematics Level II 


548 


Mathematics Level II 


562 


French 


546 


Physics 


556 


Physics 


535 


French 


548 


Chemistry 


529 


Literature 


532 


Literature 


528 


American History 


532 


English Composition 


518 


Chemistry 


528 


Biology 


513 


Biology 


526 


Spanish 


512 


English Composition 


521 


American History 


507 


Spanish 


514 


Mathematics Level I 


499 


Mathematics Level I 


506 



Several observations are clear from the orderings given above. First, the 
groups taking the Mathematics Level IT Test are the most able (as assessed by 
SAT-V scores). However, the discrepancy between SAT-V scores obtained bv this 
group and the next highest scoring group are not nearly as extreme as that 
observed when the tests were rank ordered by Achievement Test means. Second, it 
appears that the groups taking the English Composition Test, Spanish Test, and 
Mathematics Level I Test are the least verbally able; although once again, the 
large score discrepancies observed between mean Achievement Test scores are not 
apparent here. 

A review of the SAT-V means presented in Table 2 for the total groups taking 
the tests that are administered only twice a year indicates that, in general, 
higher SAT-V means are obtained by the groups taking the tests in December than 
by those taking the tests in May. The total group SAT-V means for tests 
administered twice annually rank order high to low as follows: 



December Administration May Administration 



Test 


SAT-V 

Mean 


Test 


SAT-V 

Mean 


Hebrew 


571 


Latin 


578 


European History 


564 


German 


548 


German 


563 


European History 


542 


Latin 


563 


Hebrew 


513 
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Tt is interesting to note the similarity among SAT-V means obtained by the 
European History and World Cultures, German, and Latin Test examinees who took 
the Achievement Tests at the December administration; also among the SAT-V means 
obtained by the total groups taking the German and European History and World 
Cultures Tests in May. The reverse in the ranking of the Hebrew Test means 
across the two administrations is most likely due to the instability in the data 
due to small sample sizes. 

Perusal of the information given in Table 6 and Figure 3 indicates that, 
similar to the SAT-V means previously discussed, higher SAT-M scaled score means 
are obtained by the total groups taking the tests at the November and June 
administrations. The lowest scaled score means on SAT-M are typically obtained 
by either the May or January total groups. However, for the language tests 
given five times annually (Spanish and French), SAT-M scaled score means 
obtained by the total group who took the test at the May administration are 
higher than those obtained in either December or January. 

The table given below rank orders (from high to low, using total group SAT-M 
scaled score means) Achievement Test groups who took the tests at the two 
largest volume administrations (December and June) . 



December Administration June Administration 



Test 


SAT-M 

Mean 


Test 


SAT-M 

Mean 


Physics 


644 


Mathematics Level II 


652 


Mathematics Level II 


640 


Physics 


650 


Chemistry 


624 


Chemistry 


606 


French 


570 


French 


582 


English Comoosltlon 


566 


Biology 


575 


Biology 


559 


English Composition 


572 


Spanish 


552 


Spanish 


563 


Mathematics Level I 


548 


American History 


562 


American History 


539 


Mathematics Level T 


562 


Literature 


531 


Literature 


535 
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The rank ordering of the groups across the two administrations Is quite 
similar. The total groups taking the Physics, Mathematics Level II, Chemistry, 
and French Tests appear to be the most quantitatively able, and the groups 
taking the Mathematics Level I, American History and Social Studies, and 
T^lterature Tests the le^t quantitatively able. It Is interesting to note that, 
when the rank ordering obtained above is compared to that obtained using scaled 
score means on SAT-V, fairly similar conclusions can be reached, l.e., the most 
able groups taking the Achievement Tests are those that take the Mathematics 
Level II, French, Physics, and Chemistry Tests. The least able are the groups 
taking the Mathematics Level I and American History and Social Studies Tests. 

The only group whose ranking changes considerably, depending upon whether scores 
on SAT-V or SAT-M are used to rank groups, is the total group taking the 
Literature Test. This group is, as expected, ranked much higher if SAT-V scaled 
scores are used for the ranking than If SAT-M scaled scores are used. 

A review of the SAT-M scaled score means presented in Table 6 for the total 
groups taking the tests that are administered only twice a year indicates that, 
in general, higher SAT-M means are obtained by the total groups taking the tests 
in December than by those taking the tests in May. The total group SAT-M scaled 
score means for the groups taking the tests in December and May rank order high 
to low as follows: 



December Administration 


May Administration 




SAT-M 




SAT-M 


Test 


Mean 


Test 


Mean 


German 


598 


Latin 


618 


Latin 


594 


German 


588 


Hebrew 


584 


European History 


565 


European History 


559 


Hebrew 


553 
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Several points are worth noting when examining the information given in the 
table above* First, scores on the German, Latin, and to some extent, Hebrew 
Tests are similar for the groups taking the tests at the December 
administration. However, scores obtained by the total groups taking the tests 
at the May administrations are quite diverse, ranging from a high of 618 for 
Latin Test takers to a low of 553 for the group taking the Hebrew Test. Some 
similarities in SAT-M mean scores to mean scores collected for SAT-V are worth 
noting; for example, the group taking the Latin Test in May is very able both 
verbally and quantitatively. Also, SAT-M scaled score means obtained by Hebrew 
Test takers are somewhat erratic across the administrations and probably reflect 
instabilities in the data due to small sample sizes. 

Across Test Comparisons: Correlational Analyses 

Information summarizing the correlational analyses carried out for the study 
are found in Table 7 and Figures 4-8. Figure 4 contains plots showing the 
magnitude of the correlation coefficients of Achievement Test scores with the 
following covariates: SAT-V scaled scores, SAT-M scaled scores, course grades, 

Achievement Test Background Questionnaire responses, and amount of course work 
in a particular subject area. These particular variables were chosen because 
they contributed most Importantly to the regression models that will be 
discussed in the next section. Information pertaining to tests administered 
both five times annually and twice annually are Included in Figure 4. Two 
letter symbols have been used in Figure 4 to identify the various Achievement 
Tests. It should be noted that, since the Achievement Test Background 
Questionnaire is not administered with all Achievement Tests, only a limited 
number of points have been plotted for this variable. It should be further 
noted that Figure 4 contains data pertaining only to the November Achievement 
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Test administration. Figures 5-8 contain the same information that is displayed 
in Figure 4 for the remaining four Achievement Test administrations. 



Insert Figure 4 about here 



One final point needs to be made as introduction to the discussion of the 
correlational analyses. Two variables that are included in Figures 4-8 vary 
slightly depending upon which Achievement Test is being considered. These two 
variables are course grades and amount of course work In a particular subiect 
area. The table given below lists the particular variables used for the 
correlation coefficients summarized in the figures. The two letter 
abbreviations given in parentheses are the same as those used in Figures 4-8. 

Amount of Course Work 



Test 


Symbol 




Course Grades 


in a Particular Subject Area 


Variable 

Number 


Variable Name 


Variable 

Number 


Variable Name 


Eng . Comp . 


(EN) 


12 


grades in For. Languages 


4 


c.w. in English 


Literatu re 


(LR) 


10 


grades in English 


4 


c.w. in English 


Amer. Hist. 


(AH) 


15 


grades in Social Studies 


9 


c.w. in Social Studies 


Math I 


(Ml) 


11 


grades in Mathematics 


5 


c.w. in Mathematics 


Math II 


(M2) 


11 


grades in Mathematics 


5 


c.w. in Mathematics 


Biology 


(BY) 


13 


grades in Biology 


7 


c.w. in Biology 


Chemistry 


(CH) 


14 


grades in Phvs. Sciences 


8 


c.vT. in Phys. Sciences 


Physics 


(PH) 


14 


grades in Phys . Sciences 


8 


c.w. in Phys. Sciences 


French 


(FR) 


12 


grades in For. Languages 


6 


c.w. in For. Languages 


Spanish 


(SP) 


1? 


grades in For. Languages 


6 


c.w. in For. Languages 


Eur. Hist. 


(EH) 


15 


grades in Social Studies 


9 


c.w. in Social Studies 


German 


(GM) 


12 


grades in For. Languages 


6 


c.w. in For. Languages 


Hebrew 


(HB) 


12 


grades in For. Languages 


6 


c.w. in For. Languages 


Latin 


(LT) 


12 


grades in For. Languages 


6 


c.w. in For. Languages 




Examination of the 


information provided in Table 


7 and Figure 4 for the data 



collected at the November Achievement Test administration indicates that the 
correlations of SAT-V scaled scores with Achievement Test scores range from a 
low of .44 for the Mathematics Level IT Test to a high of .84 for the Literature 
Test. Achievement Tests that have correlations of .70 or greater with this 
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variable are: English Composition, Literature, American History and Social 

Studies and Biology. 

Correlations of Achievement Test scores (for the November administration) 
with SAT-M scaled scores range from a low of .33 for the Spanish Test to a high 
of .84 for the Mathematics Level I Test. Only two Achievement Tests have scores 
that correlate above .70 with this variable; these tests are, as expected. 
Mathematics Level I and Mathematics Level II. 

The third variable to be considered Is Achievement Test Background 
Questionnaire responses. As mentioned previously, only five tests given at the 
November administration x^ere administered with Background Quef^tlonnalres. 
Correlations with Background Questionnaire responses range from a low of .29 for 
the Physics Test to a high of .49 for the French and Spanish Tests. 

Perusal of the Information found In Table 7 and Figure 4 shows that 
correlations of Achievement Test scores with course grades range from a high of 
.50 for the Mathematics Level I Test to a low of .29 for the English Composition 
Test and the Physics Test. In general, the correlations of Achievement Test 
scores with course grades are considerably lower than the correlations with 
Achievement Test scores observed for the first three variables discussed. 

An examination of the correlations of amount of course work In a particular 
subject area with Achievement Test scores obt^ned at the November 
administration Indicates that these correlations range from .09 for the English 
Composition Test to .38 for the correlation of the amount of course work In 
foreign languages with scores on the French Test. In general, the correlations 
of Achievement Test scores with this covarlate are the lowest of the correlation 
coefficients of Interest. 
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Table 7 and Figures 6 and 8 contain information summarizing the 
correlational analyses carried out for the remaining two administrations at 
which only 10 Achievement Tests are given (January and June). Examination of 
the data in these tables and figures indicates that the pattern of correlations 
discussed for the November data remains relatively unchanged, with a few 
exceptions, for tHe data obtained at the January administration of the tests. 

Insert Figures 6 and 8 about here 

The data collected for the June administration of the tests, however, 
exhlbl^Sa considerable number of discrepancies when compared to the November and 
January data. Correlations obtained for Achievement Test scores with SAT-V and 
SAT-M scaled scores are similar to those observed for the November and January 
administration. Correlations of Achievement Test scores with amount of course 
work in a particular subiect area are somewhat lower, however, than the November 
and January correlations (ranging from .05 to .31); whereas, those obtained for 
Achievement Test scores with course grades are somewhat higher than the November 
and January correlations (ranging from .35 to .51), Correlations of Achievement 
Test score with Background Questionnaire responses obtained for the June data 
are the lowest observed across the three administrations (November , January and 
June) . 

Correlation coefficients for scores on the 14 Achievement Tests that are 
given at the December and May administrations with the selected covarlates are 
summarized in Table 7 and Figures 5 and 7. Examination of the data obtained at 
the December administration (Table 7 and Figure 5) indicates that correlations 
of Achievement Test scores with SAT-V scaled scores range from .82 for the 
Literature Test to .37 for the German Test. Achievement Test scores showing 
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correlations with this covariate of ,70 or greater are English Composition, 
Literature, American History and Social Studies, and Biology. It should be 
noted that the European History and World Cultures Test also shows a high 
correlation with this variable (.69). Correlations of Achievement Test scores 
with SAT-M scaled scores range from a low of .29 for the German Test to a high 
of .82 for the Mathematics Level I Test. 

An examination of the correlations of Achievement Test scores with 
Achievement Test Background Questionnaire responses (Table 7 and Figure 5) shows 
that the correlations range from .29 for the Latin Test to .44 for the French 
and Spanish Tests. Correlations of Achievement Test scores with course grades 
range from a low of .27 for the German Test to a high of .47 for scores obtained 
on the Mathematics Level I Test. 



Finally, perusal of the correlations obtained for Achievement Test scores 
with amount of course work in a specific subject area indicate that these 
correlations range from .08 for the English Composition Test to .34 for the 
French Test. 

Comparisons of the correlation coefficients computed using data from the 
December administration with those obtained using the May data (Table 7 and 
Figure 5) show considerable similarity between the correlation coefficients 
computed using scores from these two administrations. In general, the 
correlations based on scores from the spring administration appear to be 
slightly higher than those obtained using the fall data. An exception to this 
generalization are correlations of Achievement Test scores with Background 
Questionnaire responses, which seem to be slightly higher for all tests given In 
December except the German Test. 



Insert Figure 5 about here 
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Insert Figure 7 about here 



Across Test Comparisons; Covarlate-Selectlon Analyses 

The results of the regression analyses that will be compared for the various 
Achievement Tests are summarized In Table 8 and Figure 9. Figure 9 shows plots 
of values obtained for selected regression analyses carried out for the total 
group taking 13 Achievement tests (R^ values for the Hebrew Test are not 
included) at five administrations. 

As mentioned earlier In this report, the results of the regression analyses 
should be interpreted with caution. First, although only total group results 
(based on large samples) are presented in this section, the results are sample 
dependent and may not generalize to the total population of Achievement Test 
takers. Secondly, different conclusions may have been reached If the analyses 
were not completely exploratory; l.e.. If the traditional covarlates used to 
scale Achievement Tests (SAT-V, SAT-M, and, In some cases, semesters of study) 
were Initially specified as variables to be Included In the regression analyses. 

The 13 tests (all tests except Hebrew) are represented In Figure 9 by the 
same two character symbols used to plot values of the correlation coefficients 
found In Figures 4-8. The R^ values summarize the results of what was ludged to 
be the most promising model for a particular test. Listed below are the models 
used to provide the R^ values plotted In Figure 9. Reasons for choosing these 
models will be presented In the next section of this report. It should be noted 
that no discussion of the regression analyses conducted for the Hebrew Test will 
take place here. Reasons for omitting the Hebrew Test from this discussion will 
also be presented In the next section of this report. 
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Test 



Best Model 



English Composition 
Literature 

American History and Social Studies 

Mathematics Level I 

Mathematics Level II 

Biology 

Chemistry 

Physics 

French 

Spanish 

European History and World Cultures 

Ge rman 

Latin 



Three variable model 
Three variable model 
Three variable model 
Two variable model 



Two variable model 



Three variable model 
Three variable model 
Three variable model 
Four variable model 
Four variable model 
Three variable model 
Four variable model 
Four variable model 



An examination of the information presented in Table 8 and Figure 9 
indicates that, for the November analyses, the highest R value was obtained for 
the two variable model fit to the Mathematics Level I data. The smallest 



value obtained was for the four variable model fit to the Spanish Test data. It 
is encouraging that, of the 10 tests administered in December, models can be fit 
to data obtained for six of these tests that account for at least 60% of the 
variance in Achievement Test scores. The test; showing the poorest results (in 
terms of the total variance accounted for) are the foreign language tests and 
(with the exception of the Biology Test) the science tests. 



Results of the analyses carried out for the tests administered in January 
and June are similar to those obtained using November data, i.e., the tests 
showing the poorest regression analysis results or the lowest percentage of 
variance accounted for, are the foreign language tests and the science tests. 
Still four of the tests administered in Januarv and five of the tests 
administered in June could be fit by models which accounted for at least 60% of 
the total variance in Achievement Test scores. 



Insert Figure 9 about here 
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The analyses carried out for the tests given at the December and May 

administrations indicate that the science tests, the foreign language tests, and 

the European History and World Cultures Test provide the poorest regression 

analysis results in terms of total variance accounted for. The German Test 

2 

appears to provide particularly poor results, showing R values of .30 and .37 

2 

for the December and May data, respectively. The R values obtained for the 
Mathematics Level I and Level IT Tests, the American History and Social Studies 
Test, the Literature Test, and the English Composition Test are all 
encouragingly high, indicating that it is possible to fit regression models to 
these data that account for at least 60% of the total variance in Achievement 
Test scores. 

DISCUSSION AND CONCLUSIONS 

The purpose of the study described in this report was to explore the 
relationships between College Board Achievement Test scores and potential 
scaling covariates for various subgroups of the test taking population. It was 
expected that such an exploration would lead to the following: 

o The selection of additional scaling covarlates that might provide 
Improved scaling results for those tests that do not provide scores 
correlating highly with SAT-V and/or SAT-M scores; 
o The respeclf Icatlon of the sample of students that are used to scale the 
test; and 

o The respeclf Icatlon of the hypothetical scaling population. 

The goal of the present study was to provide several alternative scaling 
procedures for the Achievement Tests. The procedures will, by their nature, 
vary components such as scaling covarlates, scaling samples, and characteristics 
of the hypothetical reference group. 
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This section of the report will be organized in the following manner. 

First, the characteristics of the scaling sample will be discussed. Secondly, 
the choice of additional scaling covarlates and their relationship to clusters 
of tests will be examined. Third, the characteristics of the hypothetical 
reference group will be discussed. 

Characteristics of Scaling Sample 

The current method of scaling the Achievement Tests employs a scaling sample 
based on high school juniors and seniors. The juniors are selected from those 
students who take the tests at the May and June administrations and the seniors 
are selected from students who take the tests at the November, December, and 
January administrations.. 

Several trends were observed in the section of the report that summarized 
the results of the grade level analyses. First, it appears that, as has 
traditionally been the situation, the tests are taken predominately by high 
school seniors at the November, December, and January administrations and by 
juniors at the May and June administrations. There are, however, some 
exceptions to this rule. For one, the Biology Test is taken predominately by 
sophomores at the June administration. Also, the Literature, Physics, and 
Mathematics Level II Tests are taken by almost equal numbers of juniors and 
seniors at the May administration. 

Given the fact that the basic test taking patterns (seniors taking the tests 
in the fall and juniors in the spring) do not seem to have changed greatly over 
the past several decades, the question is, is there a reason for revising the 
specifications that are presently used to draw the scaling samples? 

One method of discriminating among grade level groups that might provide 
scores for use in the scaling is to examine the similarity of regression models 
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fit to the data obtained for the various grade level groups taking the tests 
across the five administrations. Table 9 provides a summary of the variables 
that constitute the best two-, three-, or four-variable regression models fit to 
the junior and senior data collected across the five administrations. The 
decision regarding which regression model is most appropriate for a particular 
test (the models summarized in Table 9) will be discussed later in this section 
of the report. 

Insert Table 9 about here 

Examination of the information provided in Table 9 indicates that, for most 
of the tests, the greatest consistency among variables specified for a 
particular regression model is attained for seniors taking the tests in November 
and December and high school juniors taking the tests in May and June. For 
example, the Information contained in Table 9 for the English Composition Test 
indicates that the best three variable model fit to the senior data obtained at 
the November and December administrations of the test is consistent with the 
best three variable model fit to the scores obtained by juniors taking the test 
at the May and June administrations. 

The consistency observed for the regression models fit to the English 
Composition Test scores is greater than that observed for some of the other 
tests; however, it does appear that there is a general trend for models fit to 
fall senior scores and spring junior scores to be consistent across the 

* 

November, December, May and June administrations. 

An examination of the Information presented in Table 9 for junior and senior 
groups taking the test in January, indicates that, generally, the models 
specified for the January senior groups are more consistent with those observed 
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for other administrations than are the models specified for the junior January 
groups. 

2 

Values of R are also presented in Table 9 and were considered in making 

decisions regarding pooling data across grade levels and administrations. These 

values did not vary substantially for grade level /administration combinations 

2 

associated with a particular test. The R values did vary considerably (as 
expected) across the various tests. 

Given the above f landings, it seems reasonable to conclude that the sampling 
pattern traditionally used to select Achievement Test examinees for scaling 
studies is probably still appropriate. An additional question that remains is 
whether or not scores obtained by high school sophomores at the June 
administrations of the tests should be Included in the scaling sample. There 
are arguments against including these candidates. For one, not all tests have a 
large enough sophomore population to be considered as potential candidates for 
the scaling sample. This inconsistency in sampling by grade level across tests 
could possibly effect the scaling of the tests in some unknown way. 

Secondlngly, the number of sophomores taking a particular Achievement Test with 
scores on relevant covariates may be quite small. 

One possibility for enlarging the pool of sophomore scores for some tests 
might be to estimate sophomore scores on the missing covarlates. The current 
scaling method uses a simple linear regression procedure to estimate missing 
SAT-V and SAT-M scores. This procedure could be used to estimate other missing 
covarlates as well, and might provide a way in which complete data could be 
obtained for sophomore scores, and thus these scores could be sampled for tests 
that are taken by a large proportion of sophomores in June. However, once 
estimates were obtained for scores on missing covarlates, regression analyses 
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performed using sophomore scores (actual and estimated) obtained at the June 
administration of the test would need to be examined carefully to ensure that 
the relationship of sophomore Achievement Test scores to the salient covarlates 
was consistent with the relationship displayed by other grade levels and tests 
Included In the scaling sample for the cluster. 

An examination of the Information provided in Table 9 for those tests that 
are administered only twice a year does not yield data that show a great deal of 
consistency across December senior and May junior data. The exception to this 
is the Latin Test data, which are fit consistently with the same four variable 
model for all grade level and administration combinations. 

To summarize, the practice of sampling high school seniors taking the tests 
in November, December, and January, and juniors taking the tests In May and June 
seems appropriate. An alternative to this practice is related to the Inclusion 
of high school sophomores taking the tests in June; this alternative would 
include June sophomores, for some test clusters, if missing covariate scores can 
be successfully estimated and if the regressions show that Achievement Test 
scores obtained by this group have a similar relationship to the relevant 
covarlates as that displayed by scores obtained by other sampling groups. 

Additional Scaling Covarlates 

The traditional method of scaling the Achievement Tests uses scores on SAT-V 
and SAT-M as scaling covarlates for all 14 tests. Tn addition, semesters of 
study (as assessed by Achievement Test Background Questionnaire responses) is 
used as a covariate for scaling foreign languages tests. 

Both Braun and Tucker (see Dorans, 1985) have pointed to problems with the 
traditionally used scaling method that are related to choice of covarlates. 
Basically, the results of both their studies indicated that when ability has a 
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major impact on test selection (as it does with the Achievement Teats), there is 
a clear need to rescale the tests. The quality of this scaling will depend, 
however, on how highly the covariates used to rescale the tests correlate with 
Achievement Test performance. 

As previously noted, in the sections of this report that discuss the 
correlational analyses, the various Achievement Tests correlate quite 
differently with scores on the traditionally used covariates, SAT-V and SAT-M. 

A number of tests contain scores that are highly related to either scores on 

SAT-V or SAT-M. Examples of these tests are the English Composition Test, the 

Literature Test, and the Mathematics Level II Test. On the other hand, some 
tests provide scores that do not show a very strong relationship to scores 
obtained on either SAT-V or SAT-M. The foreign language tests show particularly 
low correlations with scores on these variables. 

In order to determine what additional covariates might be added to the model 
used to scale the Achievement Tests, correlations of test scores with the 17 
potential covarlates were examined. And, more importantly, stepwise i:egressions 
were carried out. The stepwise regressions were used to answer questions such 

as: if SAT-V and SAT-M scores are used to scale Achievement Test scores, will 

the scaling improve if semesters of study are added as an additional covariate? 
In addition, the results of the stepwise regression analyses were compared to 
determine if several tests could be clustered together, l.e., tests within a 
cluster show a similar relationship to a common core of covariates and, hence, 
can all be rescaled using these covariates. It should be noted that, in 
choosing tests to form the clusters, practical considerations were considered 
very important, l.e., efforts were made to keep tests together that formed 
logical clusters such as science, foreign languages, etc. 
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As noted in a previous section of this report, it should be pointed out that 
the stepwise regressions carried out for this study were completely exploratory; 
i.e., the variables shown for the specific models were those that maximized the 
percentage of variance accounted for in a particular data set. A logical 
procedure for specifying the regression models could have been used. If this 
had been the case, variables used for the traditional method of scaling, SAT-V, 
SAT-M, and in some cases, semesters of study, would have been Included in the 
models prior to the search for additional covarlates. 

An additional point that was noted previously, and should be noted here, is 
the sample dependency of the regression analyses that provided the particular 
models that will be discussed here. Because the samples that were used for this 
study (although large) may not be representative of the Achievement Test 
population, the results of the study should be cross validated (using a second 
sample of Achievement Test takers) to see if the regression models generalize 
across samples. 

The results of the regression analyses that were carried out for the total 
groups taking the 14 tests are summarized in Table 10. Only the results of the 
best two-, three-, and four-variable models are presented. First, examine the 
information presented for the English Composition Test. Examination of the 
three-variable models specified for this test (across the five administrations) 
indicates that the models are reasonably consistent and Include variable 1 
(SAT-V scaled scores), variable 2 (SAT-M scaled scores) and variable 12 (grades 
in foreign languages) . 

It is apparent that if the values of R (the percentage of variance in 

Achievement Test score accounted for by the particular regression model) are 

2 

compared for the two-, three-, and four-variable models, some gain in R occurs 
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when the models are expanded from two- to three— variable models. However, very 
little gain accrues when the models are further expanded to include four 
variables. Hence, one would conclude that probably the best results that can be 
expected, given the 17 available covariates, are results that can be gained 
using the best three-variable model. 

Insert Table 10 about here 

The most logical test to Include in a cluster with the English Composition 
Test is the Literature Test. Examination of the information provided in Table 
10 for the Literature Test Indicates that the best three-variable model 
specified across the five administrations is one that consistently Includes two 
variables, variable 1 (SAT-V scaled scores) and variable 10 (grades in English), 
so there is at least some consistency with the best three-variable model 
specified for the English Composition Test, although not quite as much 
consistency as one would hope for. This first grouping of tests, English 
Composition and Literature, will be referred to as Cluster 1, the English 
Cluster. 

Next, consider the information presented in Table 10 that pertains to the 

American History and Social Studies Test and the European History and World 

2 

Cultures Test. A comparison of R values across the best two-, three-, and 
four-variable models for these two tests indicates that the most appropriate 
model to select for the two tests is probably a three-variable model. The best 
three-variable model is specified somewhat consistently across the seven 
administrations of the two tests (presented in Table 10) and is one that 
Includes variables 1, 2, and 15 (SAT-V and SAT-M scores, and grades in Social 
Studies, respectively). The second grouping of tests, referred to as the 
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History Cluster, is one that consists of the American History and Social Studies 
Test and the European History and World Cultures Test. 

The procedure used to form Clusters 1 and 2, the English Cluster and the 
History Cluster, was followed in the formation of three additional clusters. 
Cluster 3 Is referred to as a Mathematics Cluster and contains the Mathematics 
Level T and Level II Tests. Cluster 4 is the Science Cluster and contains the 
Biology, Chemistry, and Physics Tests. The final cluster. Cluster 5, is the 
Languages Cluster and it contains the French, Spanish, and Latin Tests. 

It should be noted that neither the German Test nor the Hebrew Test were 
placed in clusters. This is because the results of the regression analyses 
carried out for these tests were Inconsistent with those carried out for the 
other language tests and also because the percentage of variance accounted for 
by the four-variable model specified for the German data was quite low. A 
decision to Include these tests in the Language Cluster would have to be made on 
a purely pragmatic basis and comparability of scores on these tests with the 
remaining language tests would be clearly questionable. 

The decision not to Include the Hebrew Test in the Language Cluster does not 
have particularly serious implications. This is because the test is currently 
being extensively revised and it is anticipated that the revised test will 
provide data that may be more consistent with that obtained for the other 
language tests. The decision to exclude the German Test from the Language 
Cluster does have serious implications and should be considered further. It is 
Important to determine, if possible, what there is about the German Test data 
that results in such low correlations of test score with potential covarlates. 
It may be possible that if larger samples were used, perhaps data accumulated 
across five testing years. Improved results might be obtained. Both of these 
factors will be investigated in the second phase of this study. 
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Table 11 contains a summary of the correlations of Achievement Test scores 
with the scaling covariates for the proposed clusters. An examination of the 
information displayed for the tests Included in the English Cluster indicates 
that all tests correlate highly with SAT-V scaled scores, reasonably highly with 
SAT-M scaled scores and moderately with grades in the respective subject areas. 
Similar conclusions can be drawn regarding tests comprising the History Cluster. 
However, it should be noted that the European History and World Cultures Test 
scores show a lower correlation with the covarlates than the American History 
and Social Studies Test scores. This is particularly true of scores obtained at 
the December administration of the test. 

Insert Table 11 about here 



Cluster 3, the Mathematics Cluster, contains the Mathematics Level I and II 
Tests. Proposed covarlates for these two tests are variable 2 (SAT-*M scaled 
scores) and variable 17 (Achievement Test Background Questionnaire responses). 
High correlations with variable 2 and moderate correlations with variable 17 are 
displayed by the Mathematics Level I and Mathematics Level II data summarized in 
Table 11. 

Proposed covarlates for Cluster 4, the Science Cluster, are variable 1 
(SAT-V scaled scores), variable 2 (SAT-M scaled scores), and variables 7 or 8 
(amount of course work In biology and sciences or amount of course work In 
physical sciences, respectively). The Chemistry and Physics Tests show 
correlations with the covarlates that are slightly more similar to each other 
than they are to the correlations displayed by the Biology Test scores. Scores 
obtained on the Biology Test have a tendency to be a little more highly 
correlated with SAT-V scores and a little less highly correlated with SAT-M 
scores than scores obtained on the other two tests In the cluster. 
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Cluster 5 contains the three language tests, French, Spanish and Latin. The 
lower correlations of all of the language tests with variables 1 and 2 (SAT-V 
and SAT-M scores, respectively! Is Immediately apparent. The additional 
covarlate (In addition to those used by the current procedure) proposed for this 
cluster Is variable 1? (grades In foreign languages) . All of the tests show a 
fairly strong correlation with this variable. Also, reference to the 
Information provided In Table 10 shows that expansion of the regression model 
from two to four variables provides a considerable Increase In the percentage of 
variance accounted for by the model. 

To summarize, the tests were sorted Into five possible scaling clusters 
based on the re;sults of the stepwise regression analyse. which are summarized 
In Table 10, as well as on logical considerations. The results of the 
clusterings are, to some extent, confirmed bv an examination of the correlations 
that are summarized In Table 11. It Is suggested that, based on the results of 
this portion of the study, two alternative scaling procedures be Investigated. 
Alternative One (single stage scaling) would consist of scaling the tests as 
five Independent clusters (using as covarlates the variables used to define the 
clusters) . No attempt would be made to establish any type of relationship among 
the scores obtained on tests that are members of different clusters. Alterna- 
tive Two (two stage scaling) would consist of a two-stage process. The first 
stage would consist of scaling the clusters Independently, as has been suggested 
for Alternative One. The second stage would use the results of the cluster 
scalings and scale the tests again, using SAT-V and SAT-M scores as common 
covarlates for all 14 tests. 
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Hypothetical Scaling Population 

The hypothetical reference population that is used in the current 
Achievement Test scaling procedure is assumed to have a scaled score mean of 500 
on SAT-V and SAT-M, a scaled score standard deviation of 100 on both these 
variables, and a correlation between scores obtained on SAT-V and SAT-M of .60. 
As mentioned previously. It was pointed out by Braun (see Dorans, 1985) that 
these values Influence the scaling results. The results of Braun’s studies 
indicated that choice of reference population is of critical Importance. Braun 
found that different reference groups produced different Achievement Test scale 
alignments. 

Examination of the Information presented in Table 6 and Figures 2 and 3 
indicate that, in general, SAT-V scaled score means for the total groups that 
take the Achievement Tests at the five administrations range between roughly 500 
and 575* SAT-M scaled score means obtained by the same groups range, roughly, 
between 525 and 675. The total group correlation between scores on these two 
tests (Table 2) range between .44 and .69 with the majority of the correlation 
coefficients falling somewhere in the vicinity of .55. 

No mention has been made thus far in this report of the magnitude of the 
scaled score standard deviations obtained on SAT-V and SAT-M by the various 
groups who take the Achievement Tests. Examination of the information provided 
in Table 6 indicates that SAT-V scaled score standard deviations obtained by the 
total groups taking the 14 tests range between 91 and 117. Scaled score 
standard deviations on SAT-M obtained for the same groups range between 77 and 
111 . 

It is fairly clear, from the information presented above, that SAT-V and 
SAT-M means obtained by the groups taking the 14 tests are quite diverse* This 
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result strongly emphasizes the need for scaling and for making eve»-y effort to 
improve the quality of scaling operations. It is also apparent that the group 
of students who are currently taking the Achievement Tests have mean SAT-V and 
SAT-M scores that differ substantially from the value of 500 for both scores 
assumed In the definition of the standard reference group. Because 
substantially all Achievement Test takers take the English Composition Test, a 
reasonable approximation to the mean score can be obtained by calculating the 
average score on the basis of data In Table 6. As It happens, the mean for 
SAT-V is 514 and the mean for SAT-M is 568. An alternative estimate is provided 
by results reported In National Report; High School Seniors, 1986 published by 
the College Board. In calculating these means, the SAT scores of each student 
who took the Achievement Tests In the cohort of 1986 seniors was counted once 
and only once. For SAT-V, the mean is 577 and for SAT-M, the mean Is 576. 

These results Indicate that consideration should be given to redefining the 
standard reference group so that Its statistical characteristics would be more 
consistent with the characteristics of the group currently taking those tests. 
Because a change In the definition might result In a serious dlscontlnultv In 
the score scale for some of the 14 tests, care should be taken In making this 
change to minimize Its effects on the form-to-form comparability of scores. 

Table 17 contains summary statistics for scores on the selected covariates 
obtained by the total groups constituting the five proposed scaling clusters. 
Perusal of the Information In Table 12 Indicates that a great deal of diversity 
of scores on the respective covarlates also exists within the five clusters. 

Insert Table 12 about here 
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A number of possibilities exist for alternatives to the way in which the 
hypothetical reference group is currently specified. Given the diversity of the 
scores on the covariates obtained by the various groups, an empirical approach 
might be the most feasible. This approach could be one that Involves pooling 
scores on the various covariates and using the pooled means, standard 
deviations, etc., as the values to specify for the hypothetical reference group. 
Pooled covariate data could be used within clusters, if a single stage scaling 
procedure were used or first, within, and second, across clusters, if a two 
stage scaling procedure was viewed as desirable. 

Suggested Revisions to the Traditional Scaling Methodology 

As mentioned previously, the traditionally used Achievement Test scaling 
method consists of the following components: 

o Samples are selected from seniors taking the tests at the November, 

December, and January administrations and juniors taking the tests at the 
May and June administrations. 

o The hypothetical reference population is specified to have a scaled score 
mean of 500 and a standard deviation of 100 on both SAT-V and SAT-M and a 
correlation between scores on these two variables of .60. 
o SAT-V and SAT-M scaled scores are used as common covariates for all tests 
and semesters of study are used as additional unique covariates for the 
foreign language tests. 

As a result of the analyses carried out for this study, some revisions to 
the traditionally used methodology have been suggested based on empirical 
evidence . 
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Sampling Achievement Test Scores 

It Is suggested that the traditional method of defining the scaling sample 
(selecting high school seniors who took the tests In November, December, and 
January, and high school juniors who took the tests In May and June) be 
augumented by selecting high school sophomores taking the tests In June, for 
those tests where sophomores comprise a large proportion of June test takers. 

Hypothetical Reference Group 

It Is suggested that the hypothetical reference group be defined empiri- 
cally, If a single stage scaling approach Is to be used, five hypothetical 
reference groups could be specified. Each group could be specified by pooling 
data across the covarlates on a within cluster basis. If a two stage scaling 
approach Is to be used, six hypothetical reference groups could be specified. 
First, five groups could be defined as described for the single stage scaling 
procedure and secondly, the sixth group could be defined by pooling data for the 
common covarlates (SAT-V and SAT-M scores) that are used across the five clusters. 

Scaling Models 

As alluded to previously In this report. It Is suggested that two scaling 
procedures be evaluated: 1) a single stage scaling procedure, and 2) a two 

stage scaling procedure. The single stage scaling procedure would Involve 
scaling tests solely on a within cluster basis; l.e., different covarlates, 
different reference populations, and possibly even different sampling procedures 
would be used for each of the five previously described clusters. The end 
result would be Achievement Test scales that would be unique to each cluster. 

The two stage scaling procedure builds on the single stage procedure; l.e., 
the scaled scores obtained from the single stage scalings are used as Input to 
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the second stage scaling. In this way, an attempt is made to define a single 
scale that will provide roughly comparable scores across the five clusters. 

The purpose of the study described in this report was to explore the 
relationships between College Board Achievement Test scores and potential 
scaling covariates. In addition, the characteristics of the various Achievement 
Test populations were examined to evaluate the traditionally used sampling 
procedures and the manner in which the reference population is defined. 

Although exploratory in nature, several underlying assumptions governed the 
direction of the study. One assumption was that test taking patterns may have 
changed over the yeairs both as a result of natural academic changes in the 
population and as a result of changes in ATP test administration schedules. For 
this reason, it was thought that the manner in which the scaling sample is 
selected required review. The results of the scaling study indicate that 
dramatic changes have not really taken place in the test taking patterns and, 
with minor revisions, the method traditionally used for selecting scaling 
samples is most probably still viable. 

A second asstmiption, based on the results of Tucker and Braun's studies (see 
Dorans, 1985) was that Improved scaling results might be obtained if the 
specifications for the reference population used for the scaling were revised to 
reflect more closely the characteristics of the current Achievement Test 
population. Empirical evidence collected in this study indicated that. Indeed, 
the current population (however you wish to define it) has SAT-V and SAT-M 
stimmary statistics that are quite different from those traditionally used for 
the reference scaling population. This is not necessarily bad. The issue is: 
will Improved scaling results be obtained (at least better within cluster 
comparability of scores) if multiple reference populations, or a single 
reference population, based on empirical data are used? 
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Finally, a very important assumption underlving the current study was: 
given that the traditionally used covariates that are common across the 14 tests 
produce test scales that are of unknown comparability, definition of separate 
sets of covariates for tests that appear to have a similar relationship with 
these covariates, might provide Improved scaling results. These results would 
be in the form of Improved comparability of scales for tests Included in a 
cluster. 

The regression analyses carried out for this stage of the study did suggest 
some tentative clusterings of tests. However, a number of weaknesses of the 
analyses have already been alluded to. For one, a new version of the SDO and 
also additional Background Questionnaires have been Introduced in ATP since data 
were collected for this study. It is quite likely that the results of the study 
might have been different if these additional data had been available. Also, 
scores for the Test of Standard Written English (TSWE) were not available at the 
time data were collected for this study. Tt would be interesting to investigate 
the use of these scores as an additional covarlate for tests that show a strong 
relationship to SAT-V scores. Finally, as mentioned previously, the results of 
the regression analyses are sample dependent, hence, suggested clusterings 
should be considered as tentative and subiect to cross validation. 

In conclusion, it is reconnnended that further study be carried out. It is 
suggested that particular care be paid to evaluating the new SDO, Background 
Questionnaires, and TSWE scores as potential covarlates. It is also recommended 
that the tentative clusters suggested here be cross validated. Furthermore, it 
would seem Important to Include, as one of the goals of future studies, a 
thorough investigation of problems related to the German Test such that ways in 
which the test might be included in the Language Cluster could be delineated. 
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Table 1 



Variable No. 

1 . 

2 . 

3 . 

4. 

5. 

6 . 

7. 

8 . 

9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 

16. 

17. 



Descriptions of Variables Used as Covariates 
in the Correlation and Regression Studies 



Description 

SAT-Verbal scaled scores 

SAT -Mathematical scaled scores 

High school class rank 

Amount of high school course work in English 

Amount of high school course work in Mathematics 

Amount of high school course work in foreign languages 

Amount of high school course work in Biology and Sciences 

Amount of high school course work in Physical Sciences 

Amount of high school course work in Social Studies 

Latest year and/or midyear grade received in English 

Latest year and/or midyear grade received in Mathematics 

Latest year and/or midyear grade received in foreign languages 

Latest year and/or midyear grade received in Biology and 
Sciences 

Latest year and/or midyear grade received in Physical Sciences 

Latest year and/or midyear grade received in Social Studies 

Number of honors or awards received during high school years 

Amount of training in a subject as assessed by Achievement Test 
Background Questionnaires (see Figure 3) 




Table 1 (continued) 



Variable No. 



Description 



1 



18. Highest level of education you plan to complete beyond high 
school 

19 . Highest level of education completed by your father or male 
guardian 

20. Highest level of education completed by your mother or female 
guardian 

21. Parental income 



^Tables 2 and 3 contain text of the actual questions responded to by examinees 
that supplied information for variables 3-17. 



Table 2 



Selected Questions from Student Descriptive Questionnaire (SDQ) 
Used to Collect Covariate Information for Scaling Study 



Variable SDQ 

No . No . 



Text of Question 



3 



5 What is your most recent high school class rank? (For 

example, if you are 15th in a class of 100, you are in the 
second tenth.) If you do not know your rank or rank is not 
used in your school, give your best estimate. 



(A) 


Highest tenth 




(D) 


Middle 


fifth 


(B) 


Second tenth 


top fifth 


(E) 


Fourth 


fifth 


(C) 


Second fifth 




(F) 


Lowest 


fifth 



4-9 



4 

5 



6-11 Questions 6 through 11 ask you to blacken the letter 

corresponding to the total years of study you expect to 
complete in certain subject areas. Include in the total only 
courses you have taken since beginning the ninth grade and 
those you expect to complete before graduation from high 
school. Count less than a full year in a subject as a full 
year. Do not count a repeated year of the same course as an 
additional year of study. 

(A) One year or the equivalent 

(B) Two years or the equivalent 

(C) Three years or the equivalent 

(D) Four years or the equivalent 

(E) More than four years or the equivalent 

(F) I will not take any courses in the subject area. 

6 English 

7 Mathematics 



6 

7 

8 



8 Foreign Languages 

9 Biological Sciences (for example, biology, botany, or zoology) 

10 Physical Sciences (for example, chemistry, physics, or earth 

science) 



131 



Table 2 (continued) 



Variable SDQ 

No . No . 



Text of Question 



9 11 Social Studies (for example, history, government, or 

geography) 

10-15 12-17 For each of the subject areas in questions 12 through 17, 

blacken the latest year-end or midyear grade you received 
since beginning the ninth grade. For example, if you are a 
senior and have not taken biology or any other biological 
science since your sophomore year, indicate that year -end 
grade. If you are a junior and have completed the. first half 
of the year in an English course, indicate that midyear grade. 
If you received the grade in an advanced, accelerated, or 
honors course, also blacken the letter H. 

(A) Excellent (usually 90-100 or A) 

(B) Good (usually 80-89 or B) 

(C) Fair (usually 70-79 or C) 

(D) Passing (usually 60-69 or D) 

(F) Failing (usually 59 or below or F) 

(G) Only "pass-fail" grades were assigned and I received a 
pass. 

(H) The grade reported was in an advanced, accelerated, or 
honors course . 



10 


12 


English 


11 


13 


Mathematics 


12 


14 


Foreign Languages 


13 


15 


Biological Sciences 


14 


16 


Physical Sciences 


15 


17 


Social Studies 


16 


23 


During your high school years how many honors or awards (for 
example, essay contest, debating tournament, science fair, 
music, art or theater competition, or membership in a 
scholastic honors group) have you received? 



(A) None (B) One or two (C) Three or four 
(D) Five or six (E) Seven or more 



Table 2 (continued) 



Variable SDQ 

No. No. 



Text of Question 



18 24 What is the highest level of education you plan to complete 

beyond high school? 

(A) A two-year specialized training program (for example, 
electronics or laboratory technician) 

(B) A two-year Associate of Arts degree (A. A.) 

(C) Bachelor's degree (B.A. or B.S.) 

(D) Master's degree (M.A. or M.S.) 

(E) Doctor's or other professional degree (such as M.D. or 
Ph.D.) 

(F) Other or undecided 

19 39 Indicate the highest level of education completed by your 

father or male guardian. 

(A) Grade school 

(B) Some high school 

(C) High school diploma 

(D) Buisness or trade school 

(E) Some college 

( F) Bachelor ' s degree 

(G) Some graduate or professional school 

(H) Graduate or professional degree 



20 


40 


Using the list in question 39, indicate the highest level of 
education completed by your mother or female guardian. 


21 


43 


What was the approximate income of your parents before taxes 
last year? Include taxable and nontaxable income from all 






sources . 



(A) Less than $3,000 a year (about $57 a week or less) 

(B) Between $3,000 and $5,999 a year (from $58 to $114 a 
week) 

(C) Between $6,000 and $8,999 a year (from $115 to $173 a 
week) 

(D) Between $9,000 and $11,999 a year (from $174 to $230 a 
week) 

(E) Between $12,000 and $14,999 a year (from $231 to $288 a 
week) 

(F) Between $15,000 and $17,999 a year (from $289 to $346 a 
week) 

(G) Between $18,000 and $20,999 a year (from $347 to $403 a 
week) 



Table 2 (continued) 



Variable SDQ 

No. No. Text of Question 




Table 3 



Achievement Test Background Questionnaires Used to 
Collect Covariate Information for Scaling Study 



Mathematics Level I and Level II Tests 



To provide information on your training in mathematics, please answer the two 
questions below. For each question, indicate your answer by blackening one of 
the specified spaces in the row of nine spaces labeled Q. (You will need to 
blacken at most TWO of the spaces labeled Q.) The information that you provide 
is for statistical purposes only and will not influence your score on the test . 

Question 1 

Which of the following best describes the college preparatory 
program you have taken or are taking? If you are currently 
enrolled in an Advanced Placement Mathematics course (Calculus) 

AB or Calculus BC) , blacken space 1, leave spaces 2 through 5 
blank, and go on to Question II, If not, please read the 
statements below and then blacken at most one of spaces 2 
through 5 as your answer. If none of the descriptions 
appropriately describes the mathematics cours'es you have taken 
or are taking, leave spaces 1 to 5 blank. 

Full-year courses (or their equivalents) 
elementary algebra (Algebra I), geometry, 
and intermediate algebra (Algebra II); 
at least one semester of elementary 
functions, mathematical analysis, or 
advanced algebra (Algebra III); and 
trigonometry as a separate course for 
at least one semester, with a grade for 

each mathematics course of C or better. ■* Blacken space 2. 

Full-year (or their equivalents) 
in elementary algebra (Algebra I), 
geometry, and intermediate algebra 
(Algebra II); at least one semester 
elementary functions, mathematical 
analysis, or advanced algebra 
(Algebra III); and trigonometry 
as part of these courses, with a 
grade for each mathematics course 
of C or better. 



- Blacken space 3 , 



Table 3 (continued) 



Mathematics Level I and Level II Tests (continued) 



Full -year courses (or their equivalents) 
in elementary algebra (Algebra I), 
geometry, and intermediate algebra 
(Algebra II) ; and at least one semester 
of elementary functions, mathematical 
analysis, or advanced algebra (Algebra 
III), with a grade for each mathematics 

course of C or better. - Blacken space 4. 

Full-year courses (or their equivalents) 
in elementary algebra (Algebra I), 
geometry, and intermediate algebra 
(Algebra II) , with a grade for each 

mathematics course of C or better. - Blacken space 5. 



Question II 



Which of the following best describes how long it has been 
since you took a course in plane geometry? Blacken only 
one of spaces 6 through 9 as your answer to Question II. 



I have never taken a course in plane 
geometry . 

One school year or less. 

More than one school year but less 
than two school years. 

More than two school years. 



- Blacken space 6 . 

- Blacken space 7. 

- Blacken space 8. 

- Blacken space 9. 



O 
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Table 3 (continued) 



Physics Test 



To provide information on training in physics, please answer the question below. 
Indicate your answer by blackening one of the first four spaces labeled Q on your 
answer sheet. Your response will not influence your test score. 



Question 1 

How many semesters of instruction have you had in a high school physics course or 
courses? (There are two semesters in a school year. If you are taking physics 
in the current semester and the semester is more than half over , count this 
current semester as a full semester.) 



Space 1 



Space 2 



One semester 
Two semesters 
Three semesters 
Space 4 : Four semesters 

Spaces 5 to 9 : (Leave blank) 



Space 3 



O 
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Table 3 (continued) 



French Test 



1 



In the group of nine spaces labeled Q, you are to blacken ONE and ONLY ONE space, 
as described below, to indicate how you obtained your knowledge of French. The 
information that you provide is for statistical purposes only and will not 
influence your score on the test . 

Question 1 



If your knowledge of French does 
not come primarily from courses 
taken in grades 9 through 12, 
blacken space 9 and leave the 
remaining spaces blank, regard- 
less of how long you studied the 
subject in school. For example, 
you are to blacken space 9 if 
your knowledge of French comes 
primarily from any of the 
following sources: study prior 

to the ninth grade, courses 
taken at a college, or special 
study, residence abroad, or 
living in a home in which French 
is spoken. 



If your knowledge of French does 
come primarily from courses taken 
in grades 9 through 12, blacken 
the space that indicates the 
level of the French course in 
which you are currently enrolled. 
If you are not now enrolled in a 
French course, blacken the space 
that indicates the level of the 
most advanced course in French 
that you have completed. 



Level I : 
Level II : 

Level III: 

Level IV: 



first or second half 

first half 

second half 

first half 

second half 

first half 

second half 



Advanced Placement or course 
that represents a level of 
study higher than Level IV: 
second half 



If you are in doubt about whether to mark 
1-8, mark space 9. 



- blacken space 1 

- blacken space 2 

- blacken space 3 

- blacken space 4 

- blacken space 5 

- blacken space 6 

- blacken space 7 

- blacken space 8 



space 9 rather than one of the spaces 



The same questionnaire (with the appropriate test name) appears in the French, 
German, Latin and Spanish Tests. The Latin questionnaire differs slightly in 
that the phrase, "...or living in a home in which [language] is spoken" is 
eliminated. 

138 



Table 4 



Specifications for Recoding Responses to Student Descriptive 
Questionnaire and Achievement Test Background Questionnaires 



Student Descriptive Questionnaire 



Question 

No. 



Question 



Response Code 



5 



6-11 



12-17 



23 



24 



High school class rank 



Amount of high school course work 



Latest year and/or midyear grade 
in specific subjects 



Honors or awards 



Highest level of planned education 



F 1 

E 2 

D 3 

C 4 

B 5 

A 6 

F 1 

A 2 

B 3 

C 4 

D 5 

E 6 

F 1 

D 2 

C&G 3 

B 4 

A 5 



H If H is marked, 
advance students 
code designation 
by one . 



A 

B 

C 

D 

E 



1 

2 

3 

4 

5 



A 

B 

C 

D 

E 

F 



1 

2 

3 

4 

5 

(delete) 
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Table 4 (continued) 



Student Descriptive Questionnaire 


Question 
















No. 


Question 


Response 




Code 




39 


Father's level of education 




A 






1 










B 






2 










C 






3 










D 






4 










E 






5 










F 






6 










G 






7 










H 






8 




40 


Mother's level of education 




A 






1 










B 






2 










C 






3 










D 






4 










E 






5 










F 






6 










G 






7 - 










H 






8 




43 


Parental income 


A 1 




F 


6 


K 


11 






B 2 




G 


7 


L 


12 






C 3 




H 


8 


M 


13 






D 4 




I 


9 


N 


14 






E 5 




J 


10 


0 


15 
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Table 4 (continued) 



Achievement Test Background Questionnaires 


Test 


Response 


Code 




Question 1 


Question 2 




French , Spanish , 


1 


- 


Removed from sample 


German and Latin 


2 


- 


3 




3 


- 


4 




4 


- 


5 




5 


- 


6 




6 


- 


7 




7 


- 


8 




8 


- 


9 




9 


- 


Removed from sample 


Math Level I and 


1 


6 


5 


Math Level II 


1 


7 


5 




1 


8 


5 




1 


9 


5 




2 


6 


4 




2 


7 


4 




2 


8 


4 




2 


9 


4 




3 


6 


3 




3 


7 


3 




3 


8 


3 




3 


9 


3 




4 


6 


2 




4 


7 


2 




4 


8 


2 




4 


9 


2 




5 


6 


1 




5 


7 


1 




5 


8 


1 




5 


9 


1 


Physics 


1 


- 


1 




2 


- 


2 




3 


- 


3 




4 


- 


4 
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SuiDmary Statistics by Grade Level and Administration 
Date for Fourteen Achievement Tests 
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Total Includes examinees who did not specify grade level. 

Suimary Statistics were computed for all available cases who had Achievement Test Scores. 
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Table 6 (continued) 

SAT“Verbal and SAT-Mathemat Leal Suinnary Statistics by Grade Level 
and Administration Date for Fourteen Achievement Tests 
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Table 8 (continued) 

Suimiary of Stepwise Regression Procedure 
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Summary of Stepwise Regression Procedure 
Bioiogy Test 

(continued) 
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Sufimary of Stepwise Regression Procedure 
Chemistry Test 

(continued) 
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Table 8 (continued) 

Sumnary of Stepwise Regression Procedure 
Spanish Test 
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Table 8 (continued) 

S^uninary of Stepwise Regression Procedure 
German Test 
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Summary of Stepwise Regression Procedure 
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T«bl« 9 



Sunnary of Bast Two- , Thraa-*, or Four'Variabla Rasrassion Modals ^ 
Specifiad by Grada Laval for Fiva Achiavaoant Tast Administrations 





Grade 




Test 


Level 


Noveoiber 


Eng. Comp. 


Junior 

R 

Senior 


1.2,6 

.76 

1,2,12 

.67 


Literature 


Junior 

Senior 


1,10,9 

.71 


Amer. Hist. 


J|jnior 






Sgnior 


1.2,15 

.61 


Math I 


Junior 

Senior 

R^ 


2,17 

.74 

2,17 

.72 


Math II 


J)jnior 






Sjnior 


2,17 

.62 


Biology 


Junior 






Senior 

R^ 


1,2,13 

.64 


Chemistry 


Jijnior 






Senior 


1,2,8 

.50 


Physics 


Junior 

R^ 

Senior 

R^ 


1.2,17 

.47 


French 


Junior 

R^ 

Senior 


1,2,17,6 

.51 


Spanish 


Jjpior 


17,7,9,15 




Senior 

r2 


1.2,17,12 

.38 


Eur. Hist. 


Junior 

R 

Senior 

R^ 


- 


German 


Junior 

Senior 

r2 


- 


Hebrew 


Jjjnior 


- 




Senior 


- 


Latin 


Junior 

R* 


- 



Admin istrat ion /Var jab las 



December 


Januarv 


May 


1,2,6 


1,2,21 


1,2,12 


.66 


.68 


.68 


1,2,12 


1,2,10 


1,2,10 


.60 


.65 


.63 

1.2,4 

.72 


1,3,10 


1,2,10 


1,6,10 


.68 


.71 


.69 

1,2,15 

.62 


1,2,15 


1,2,15 


1,8,15 


.60 


.57 


.61 


2,11 


2,8 


2,11 


.74 


.72 


.73 


2,17 


2,17 


2,17 


:8 


.66 


.66 


2,11 


2,17 


2,11 


.68 


.67 


.64 


2,17 


2,17 


2,17 


.62 


.65 


.67 


1,2,9 


1,20,21 


1.2,13 


.52 


.72 


.67 


1,2,7 


1,2,7 


1.2.7 


.60 


.54 


.57 


1.2,13 


2,5,21 


1,2,8 


.71 


.55 


.58 


1.2,8 


1.2,8 


1.2,8 


.55 


.52 


.56 

1.2.8 

.54 


1,2,17 


1.2,17 


1,2,17 


.53 


.53 


.47 


17,12,15,18 


1,17,9,21 


1,2,17,12 


.35 


.59 


.53 


1,2,17,12 


1,2,17,16 


1,17,6,16 


.43 


.40 


.55 


1.17,4,5 


7,12,15,16 


1,2,17,12 

.47 


1,17,12.21 


1,17,12,21 


1,2,17,12 


.34 


.26 


.33 




- 


1,2,15 

.54 


1.2,10 


- 


1.2,15 


.50 




.67 




- 


1.2.17,8 

.31 


1.17,6,12 


- 


2,17.7.21 


.30 


- 


.43 






1.2,17,12 



.36 



Juna 



1,2,12 

.66 

1,2,10 

.64 

1,2,4 

.69 

1.3,20 

.72 

1,2,15 

.63 

1,2,15 

.66 

2,11 

.71 

2,11 

.69 

2,11 

.62 

2,11 

.62 

1.2.13 
.57 

1.2.7 
.64 

1.2.14 
.55 

1 . 2.8 

.62 

1 . 2,8 

.53 

1.2,17 

.54 

1.2.17.12 
.46 

1.17.6.12 
.54 

1.2.17.12 
.43 

1.2.17.12 
.45 



Tabu 10 



Baat Two, Thraa or Four VarlabU Ragraaalon HodjU for 
Total Groupa Taking tha Achlavaomnt Taata 



Engllah Coopoaltlon Taat 



Adtalnlatratlon 





Novaoibar 


Dacaobar 


January 


May 


June 


2 Var. Modal 


Var. 1,2 


Var. 1,12 


Var. 1,2 


Var. 1,2 


Var. 1,2 




.66 


.59 


.6A 


.67 


.65 


3 Var. Modal 


Var. 1,2,12 


Var. 1,2,12 


Var. 1,2.10 


Var. 1,2,12 


Var. 1,2,12 




.67 


.60 


.65 


.68 


.66 


A Var. Modal 


Var. 1,2,6,10 


Var. 1,2,6.10 


Var. 1,2,10,12 


Var. 1,2,6,10 


Var. 1,2,6,10 


r2 


.67 


.60 


.65 


.68 


.66 


N 


37,188 


121,770 


53,877 


12.977 


68,5A7 


Lltaratura Taat 








Admlnlatratlon 








Novambar 


Decaober 


January 


May 


June 


2 Var. Model 


Var. 1,10 


Var. 1,10 


Var. 1,10 


Var. 1,10 


Var. 1,10 


r" 


.71 


.68 


.71 


.70 


.70 


3 Var. Model 


Var. 1,10,19 


Var. 1,3,10 


Var. 1,2,10 


Var. 1,10,16 


Var. 1,3,10 


r2 


.71 


.68 


.71 


.71 


.70 


A Var. Model 


Var. 1,3,10,19 


Var. 1,2,10,16 


Var. 1,2,10,16 


Var. 1,2,10,16 


Var. 1,3,8,10 


r2 


.71 


.68 


.71 


.71 


.70 


N 


A, 392 


12,721 


6,A99 


1,256 


5,078 




Aroarlcan Hlatory and Social Studlaa 


Taat 










Admlnlatratlon 








Movanbar 


Dacanbar 


January 


May 


June 


2 Var. Modal 


Var. 1,15 


Var. 1,15 


Var. 1,15 


Var. 1,15 


Var. 1,15 




.60 


.58 


.56 


.63 


.61 


3 Var. Modal 


Var. 1,2,15 


Var. 1,2,15 


Var. 1.2,15 


Var. 1,2,15 


Var. 1,2,15 


r2 


.61 


.60 


.57 


.65 


.63 


A Var. Modal 


Var. 1.2,10,15 


Var. 1,2.9,15 


Var. 1,2,8,15 


Var. 1,2,9,15 


Var. 1,2,9,15 


r2 


.61 


.60 


.57 


.66 


.6A 


N 


7,533 


17,061 


9,800 


2,836 


23,1A6 
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T«bl« 10 (continutd) 



Best Two, Three or Four Variable Resresslon Models for 
Total Groups Taking the Achievement Tests'*^ 







Mathematics 


Level I Test 






Administration 




November 


December 


January 


May 


June 


2 Var. Model 


Var. 2,17 


Var. 2,17 


Var. 2,17 


Var. 2,17 


Var. 2,17 




.72 


.68 


.67 


.71 


.72 


3 Var. Model 


Var, 2,17,11 


Var. 7,17,11 


Var. 2,17,11 


Var. 2,17,11 


Var. 1,2,11 


r" 


.73 


.69 


.67 


.71 


.72 


A Var. Model 


Var. 2,17,8,11 


Var. 1,2,17,11 


Var. 1,2,17,11 


Var. 2,17,5,11 


Var. 2,17,5,11 


r" 


.73 


.70 


.68 


.72 


.73 


N 


25,153 


83,3A1 


A2,275 


10,AA8 


61,096 


Mathematics Level II Test 








Administration 








November 


December 


January 


May 


June 


2 Var. Model 


Var. 1,17 


Var. 2,17 


Var. 2,17 


Var. 2,11 


Var. 2,11 


r" 


.63 


.62 


.66 


.73 


.65 


3 Var. Model 


Var. 2,17,11 


Var. 2,17,11 


Var. 1,2,17 


Var. 1,2,17 


Var. 1,2,11 




.63 


.62 


.66 


.73 


.65 


A Var. Model 


Var. 1,2,17,11 


Var. 1,2,17,11 


Var. 1,2,17,11 


Var. 1,1,17,11 


Var. 1,2,17,11 


r" 


.63 


.63 


.66 


.73 


.65 


N 


11,332 


25,690 


12,67A 


A, 033 


18,755 


Biology Test 








Administration 








November 


December 


January 


May 


June 


2 Var. Model 


Var. 1,2 


Var. 1,2 


Var. 1,2 


Var. 1,2 


Var. 1,2 


r" 


.62 


.58 


.57 


.6A 


.56 


3 Var. Model 


Var. 1,2,13 


Var. 1,2,7 


Var. 1,2,7 


Var. 1,2,13 


Var. 1,2,13 


r" 


.63 


.60 


.59 


.65 


.58 


A Var. Model 


Var. 1,2,7,13 


Var. 1,2,7,13 


Var. 1,2,7,13 


Var. 1,2,7,13 


Var. 1,2,7,13 


r" 


.6A 


.60 


.60 


.66 


.58 


N 


A, 336 


13,3A9 


7,628 


1,A00 


18,88A 



o 

ERIC 
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T«bl« XO (continued) 



Bttt Two, Thr«« or Four Vari«bl« Rogrostion Models for 
Totol Groups Taking tha Achiavaoant Tasts 



Chanistry Tast 


Administration 




Movambar 


Dacaivbar 


January 


May 


June 


2 Var. Model 


Var. 1,2 


Var. 1,2 


Var. 1,2 


Var. 1,2 


Var. 1,2 




.47 


.53 


.48 


.56 


.53 


3 Var. Model 


Jk.te* 

Var. 1,7,8 


Var. 1,7,8 


Var. 1,2,8 


Var. 1,2,8 


Var. 1,2,8 




.50 


.55 


.52 


.59 


.55 


A Var. Model 


Var. 1,2,8,14 


Var. 1,2,8,U 


Var. 1,2,8,14 


Var. 1,2,8,14 


Var. 1,2,8,14 


r2 


.51 


.56 


.53 


.60 


.56 


M 


5,773 


14,112 


7,412 


1,635 


24,384 


Physics Tast 








Administration 








Novambar 


Dacaflibar 


January 


May 


June 


2 Var. Model 


Var. 1,2 


Var. 1,2 


Var. 1,2 


Var. 1,2 


Var. 1,2 


r2 


.43 


.48 


.49 


.51 


.53 


3 Var. Modal 


Var. 1,2,17 


Var. 1,2,17 


Var. 1,2,17 


Var. 1,2,8 


Var. 1,2,8 




.47 


.53 


.53 


.54 


.54 


4 Var. Modal 


Var. 1,2,17,8 


Var. 1,2,17,8 


Var. 1,2,17,8 


Var. 1,2,17,8 


Var. 1,2,8,14 


r2 


.48 


.54 


.54 


.55 


.54 


M 


3,040 


8,118 


6,867 


1,114 


8,065 


Franch Tast 








Administration 








Movambar 


Dacambar 


January 


May 


June 


2 Var. Modal 


Var. 1,17 


Var. 1,17 


Vsr. 1,17 


Var. 1,17 


Var. 1,17 




.48 


.39 


.37 


.48 


.40 


3 Var. Modal 


Var. 1,2,17 


Var. 1,17,12 


Var. 1,2,17 


Var. 1,17,6 


Var. 1,17,12 




.49 


.41 


.39 


.51 


.44 


h Var. Modal 


Var. 1,2,17,6 


Var. 1,2,17,12 


Var. 1,2,17,6 


Var. 1,17,6,12' 


Var. 1,2,17,12 


r2 


.50 


.43 


.40 


.53 


.45 


N 


2,891 


11,344 


3,927 


1,491 


8,101 
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T«bl« 10 (continuad) 



Bast Two, Thrss or Four Vsrisbia Ragrassion Models for 
Total Groups Taking tha Achiavaaant Tasts 



Spanish Tast 


Adninistration 




Novanbar 


Dacaobar 


January 


May 


Juna 


2 Var. Modal 


Var. 1,17 


Var. 1,17 


Var. 17,12 


Var. 1,17 


Var. 1,17 




.36 


.31 


.22 


.33 


.37 


3 Var. Modal 


Var. 1,17,12 


Var. 1,17,12 


Var. 1,17,12 


Var. 1,17,12 


Var. 1,17,12 




.38 


.34 


.24 


.37 


.41 


4 Var. Modal 


Var. 1.2,17, 


12 Var. 1,17,12,21 


Var. 1,17,12,21 


Var. 1,2,17,12 


Var. 1,2,17,12 


r2 


.39 


.34 


.26 


.39 


.42 


N 


3,373 


11,436 


4,725 


1,446 


8,676 


Europaan History and World Culturas Tast 








Administration 








Novambar 


Dacaobar 


January 


May 


Juna 


2 Var. Modal 




Var. 1,2 




Var. 1,2 




r2 




.49 




.57 




3 Var. Modal 




Var. 1,2,10 




Var. 1,2,15 








.49 




.59 




4 Var. Modal 




Var. 1,2,10,15 




Var. 1,2,9,15 




r2 




.51 




.60 




N 




3,010 




697 




Gaman Tast 








Administration 








Novafobar 


Dacaobar 


January 


May 


Juna 


2 Var. Modal 




Var. 1,17 




Var. 2,17 




r2 




.24 




.32 




3 Var. Modal 




Var. 1,17,6 




Var. 2,17,6 




r2 




.28 




.35 




4 Var. Modal 




Var. 1,17,6,12 




Var. 2,17,6,8 




r2 




.30 




.37 




N 




2,519 




461 
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Tabl« 10 (continued) 



B«at Two» Thxaa or Four Varlabla Ragraaalon Hodala for 
Total Oroupa Taking tha Achlavaaant Taata^ 



Habraw 


Adttinlatratlon 




Novaadsar 


Dacaobar January 


May 


Juna 


2 Var. Modal 




Var. 2,19 










.31 






3 Var. Modal 




Var. 2,16,19 










.40 






4 Var. Modal 




Var. 2,17,12,19 






r2 




.47 






. N 
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Latin Taat 


Adniniatration 




Movanbar 


Dacaobar January 


May 


Juna 


2 Var. Modal 




Var. 1,2 


Var. 1,2 




r2 




.35 


.36 




3 Var. Modal 




Var. 1,2,17 


Var. 1,2,17 




r2 




.39 


.41 




4 Var. Modal 




Var. 1,2,17,12 


Var. 1,2,17,12 




r2 




.42 


.43 




H 




1,948 


738 




S^ariabla namaa 


ara found 


in Tabla 1. 
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Table 11 



Correlation of Scaling Covarlates with Achievement Test 
Score for Proposed Scaling Clusters 



Cluster One (English) 



Administrations 





Nov. 


Dec. 


Jan. 


May 


June 


English Composition Test 












Var 1 


.81 


.76 


.79 


.81 


.80 


Var 2 


.56 


.50 


.54 


.62 


.59 


Var 12 


.29 


.31 


.28 


.36 


.35 


Literature Test 










C 


Var 1 


.84 


.82 


.85 


.83 


.84 


Var 2 


.54 


.49 


.52 


.55 


.55 


Var 10 


.36 


.38 


.40 


.39 


.42 



Cluster Two (History) 



American History and 
Social Studies Test 












Var 1 


.76 


.75 


.73 


.77 


.77 


Var 2 


.53 


.50 


.50 


.60 


.59 


Var 15 


.41 


.39 


.38 


.51 


.46 


European History and 
World Cultures Test 












Var 1 




.69 




.75 




Var 2 




.46 




.51 




Var 15 




.31 




.42 
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Table 11 (continued) 



Correlation of Scaling Covariates with Achievement Test 
Score for Proposed Scaling Clusters 



Cluster Three (Mathematics) 







Administrations 






Nov. 


Dec. 


Jan. 


May 


June 


Mathematics 


Level I 












Test 














Var 2 




.84 


.82 


.81 


.83 


.84 


Var 17 




.44 


.42 


.43 


.35 


.31 


Mathematics 


Level II 












Test 














Var 2 




.79 


.78 


.81 


.83 


.79 


Var 17 




.32 


.33 


.34 


.30 


.19 



Cluster Four (Science) 





Administrations 




Nov. 


Dec. 


Jan. 


May 


June 


Biology Test 












Var 1 


.75 


.72 


.72 


.74 


.67 


Var 2 


.61 


.62 


.62 


.70 


.64 


Var 7 


.28 


.21 


.24 


.27 


.17 


Chemistry Test 












Var 1 


.57 


.58 


.55 


.60 


.59 


Var 2 


.63 


.68 


.64 


.70 


.67 


Var 8 


.35 


.31 


.35 


.35 


.22 


Physics Test 












Var 1 


.50 


.53 


.54 


.58 


.57 


Var 2 


.62 


.63 


.63 


.67 


.68 


Var 8 


.30 


.30 


.32 


.30 


.23 
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Table 11 (continued) 

Correlation of Scaling Covariates with Achievement Test 
Score for Proposed Scaling Clusters 



Cluster Five (Languages) 



Administrations 





Nov. 


Dec . 


Jan. 


May 


June 


French Test 












Var 1 


.58 


.51 


.44 


.54 


.55 


Var 2 


.45 


.40 


.37 


.48 


.48 


Var 17 


.49 


.44 


.42 


.51 


.41 


Var 12 


.30 


.33 


.27 


.39 


.38 


Spanish Test 












Var 1 


.46 


.39 


.23 


.45 


.51 


Var 2 


.33 


.30 


.17 


.40 


.42 


Var 17 


.49 


.44 


.41 


.45 


.40 


Var 12 


.33 


.31 


.28 


.40 


.39 



Latin Test 
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Figure 1 



Achievement Test Scaled Score Means for the Total Groups 
Taking the Tests at the Five Administrations 
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SAT-V Scaled Score Mean 




Achievement Test Administration 
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SAT-M Scaled Score Mean 



Figure 3 



SAT-M Scaled Score Means for the Total Groups Taking 
the Tests at the Five Administrations 
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Score with Covariate 



Figure 4 



Correlations of Achievement Test Scores with Selected 
Covariates for November Achievement Test Administration 




November Achievement Test Administration 
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Correlation of ACH Test Score with Covariate 



Figure 5 



Correlations of Achievement Test Scores with Selected 
Covariates for December Achievement Test Administration 
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SAT-V SAT-M Background Course Amount of 

Scaled Scaled Question Grades Coursework 

Score Score Responses 

December Achievement Test Administration 
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Correlation of ACH Test Score with Covariate 



Figure 6 



Correlations of Achievement Test Scores with Selected 
Covariates for January Achievement Test Administrations 




SAT-V SAT-M Background Course Amount of 
Scaled Scaled Question Grades Coursework 
Score Score Responses 



January Achievement Test Administration 



Correlation of ACH Test Score with Covoriote 



Figure 7 



Correlations of Achievement Test Scores with Selected 
Covariates for May Achievement Test Administration 



1 

0 . 9 
0 . 8 
0 . 7 
0 , 6 
0 . 5 
0 . 4 
0 -. 3 
0 . 2 
0 . 1 
0 




May Achievement Test Administration 



o 

ERIC 



221 



Correlation of ACH Test Score with Covariate 
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Figure 8 

Correlations of Achievement Test Scores with Selected 
Covariates for June Achievement Test Administration 




June Achievement Test Administration 
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for Best Two- Three- or Four- Variable Model 



Figure 9 



2 

R Values for Best Two- , Three- , and Four-Variable Models 
for Fourteen Achievement Tests 
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